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NEISSERIA MENINGITIDIS ANTIGENS AND COMPOSITIONS 

This application is a continuation-in-part of the following U.S. Provisional Patent 
applications, from each of which priority is claimed, and each of which is incorporated by 
reference in its entirety: 60/083,758 (filed May 1, 1998); 60/094,869 (filed July 31, 1998); 
60/098,994 (filed September 2, 1998); 60/099,062 (filed September 2, 1998); 60/103,749 
(filed October 9, 1998); 60/103,794 (filed October 9, 1998); 60/103,796 (filed October 9, 
1998); and 60/121,528 (filed February 25, 1999). 

This invention relates to antigens from the bacterial species: Neisseria meningitidis 
and Neisseria gonorrhoeae, 

BACKGROUND 

Neisseria meningitidis is a non-motile, gram negative diplococcus human pathogen. It 
colonizes the pharynx, causing meningitis and, occasionally, septicaemia in the absence of 
meningitis. It is closely related to N. gonorrhoea, although one feature that clearly 
differentiates meningococcus from gonococcus is the presence of a polysaccharide capsule 
that is present in all pathogenic meningococci. 

N. meningitidis causes both endemic and epidemic disease. In the United States the 
attack rate is 0.6-1 per 100,000 persons per year, and it can be much greater during outbreaks, 
(see Lieberman et al (1996) Safety and Immunogenicity of a Serogroups A/C Neisseria 
meningitidis Oligosaccharide-Protein Conjugate Vaccine in Young Children. JAMA 
275(19):1499-1503; Schuchat et al (1997) Bacterial Meningitis in the United States in 1995. 
NEnglJMed 337(14):970-976). In developing countries, endemic disease rates are much 
higher and during epidemics incidence rates can reach 500 cases per 100,000 persons per 
year. Mortality is extremely high, at 10-20% in the United States, and much higher in 
developing countries. Following the introduction of the conjugate vaccine against 
Haemophilus influenzae, N meningitidis is the major cause of bacterial meningitis at all ages 
in the United States (Schuchat et al (1997) supra). 

Based on the organism's capsular polysaccharide, 12 serogroups ofN meningitidis 
have been identified. Group A is the pathogen most often implicated in epidemic disease in 
sub-Saharan Africa. Serogroups B and C are responsible for the vast majority of cases in the 
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United States and in most developed countries. Serogroups W135 and Y are responsible for 
the rest of the cases in the United States and developed countries. The meningococcal vaccine 
currently in use is a tetravalent polysaccharide vaccine composed of serogroups A, C, Y and 
W135. Although efficacious in adolescents and adults, it induces apoor immune response and 
short duration of protection, and cannot be used in infants [eg. Morbidity and Mortality 
weekly report, Vol.46, No. RR-5 (1997)]. This is because polysaccharides are T-cell 
independent antigens that induce a weak immune response that cannot be boosted by repeated 
immunization. Following the success of the vaccination against H.influenzae, conjugate 
vaccines against serogroups A and C have been developed and are at the final stage of clinical 
testing (Zollinger WD "New and Improved Vaccines Against Meningococcal Disease". In- 
New Generation Vaccines, supra, pp. 469-488; Lieberman et al (1996) supra; Costantino et al 
(1992) Development and phase I clinical testing of a conjugate vaccine against 
meningococcus A and C. Vaccine 10:691-698). 

Meningococcus B (menB) remains a problem, however. This serotype currently is 
responsible for approximately 50% of total meningitis in the United States, Europe, and South 
America. The polysaccharide approach cannot be used because the menB capsular 
polysaccharide is a polymer of a(2-8)-linked ^-acetyl neuraminic acid that is also present in 
mammalian tissue. This results in tolerance to the antigen; indeed, if an immune response 
were elicited, it would be anti-self, and therefore undesirable. In order to avoid induction of 
autoimmunity and to induce a protective immune response, the capsular polysaccharide has 
for instance, been chemically modified substituting the ^-acetyl groups with iV-propionyl ' 
groups, leaving the specific antigenicity unaltered (Romero & Outschoom (1994) Current 
status of Meningococcal group B vaccine candidates: capsular or non-capsular? Clin 
Microbiol Rev 7(4):559-575). 

Alternative approaches to menB vaccines have used complex mixtures of outer 
membrane proteins (OMPs), containing either the OMPs alone, or OMPs enriched in porins 
or deleted of the class 4 OMPs that are believed to induce antibodies that block bactericidal ' 
activity. This approach produces vaccines that are not well characterized. They are able to 
protect against the homologous strain, but are not effective at large where there are many 
antigenic variants of the outer membrane proteins. To overcome the antigenic variability 
multivalent vaccines containing up to nine different porins have been constructed {eg. 
Poolman JT (1992) Development of a meningococcal vaccine. Infect. Agents Dis. 413-28) 
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Additional proteins to be used in outer membrane vaccines have been the opa and opc 
proteins, but none of these approaches have been able to overcome the antigenic variability 
(eg, Ala'Aldeen & Borriello (1996) The meningococcal transferrin-binding proteins 1 and 2 
are both surface exposed and generate bactericidal antibodies capable of killing homologous 
and heterologous strains. Vaccine 14(l):49-53). 

A certain amount of sequence data is available for meningococcal and gonoccocal 
genes and proteins (e.g. EP-A-0467714, W096/29412), but this is by no means complete. 
Other men B proteins may include those listed in WO 97/28273, WO 96/29412, WO 
95/03413, US 5,439,808, and US 5,879,686. 

The provision of further sequences could provide an opportunity to identify secreted 
or surface-exposed proteins that are presumed targets for the immune system and which are 
not antigenically variable. For instance, some of the identified proteins could be components 
of efficacious vaccines against meningococcus B, some could be components of vaccines 
against all meningococcal serotypes, and others could be components of vaccines against all 
pathogenic Neisseriae including Neisseria meningitidis or Neisseria gonorrhoeae. Those 
sequences specific to N. meningitidis or N gonorrhoeae that are more highly conserved are 
further preferred sequences. 

It is thus an object of the invention is to provide Neisserial DNA sequences which 
encode proteins that are antigenic or immunogenic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 illustrates the products of protein expression and purification of the predicted 
ORF 919 as cloned and expressed in E. coli. 

Fig. 2 illustrates the products of protein expression and purification of the predicted 
ORF 279 as cloned and expressed in E. colu 

Fig. 3 illustrates the products of protein expression and purification of the predicted 
ORF 576-1 as cloned and expressed in E. coli. 

Fig. 4 illustrates the products of protein expression and purification of the predicted 
ORF 519-1 as cloned and expressed in E. coli. 

Fig. 5 illustrates the products of protein expression and purification of the predicted 
ORF 121-1 as cloned and expressed in E. coli. 
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Fig. 6 illustrates the products of protein expression and purification of the predicted 

ORF 128-1 as cloned and expressed inE. coli. 

Fig. 7 illustrates the products of protein expression and purification of the predicted 

ORF 206 as cloned and expressed in E. coli. 

Fig. 8 illustrates the products of protein expression and purification of the predicted 

ORF 287 as cloned and expressed in E. coli. 

Fig. 9 illustrates the products of protein expression and purification of the predicted 
ORF 406 as cloned and expressed in E. coli. 

Fig. 10 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 919 as cloned and expressed in* coli. 

Fig. 1 1 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 279 as cloned and expressed inE. coli. 

Fig. 12 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 576-1 as cloned and expressed in E. coli. 

Fig. 13 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 519-1 as cloned and expressed in E. coli. 

Fig. 14 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 121-1 as cloned and expressed in E. coli. 

Fig. 15 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 128-1 as cloned and expressed in E. coli. 

Fig. 16 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 206 as cloned and expressed in E. coli. 

Fig. 1 7 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 287 as cloned and expressed in E. coli. 

Fig. 18 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 406 as cloned and expressed in E. coli. 

Fig. 19 shows an alignment comparison of amino acid sequences for ORF 225 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
mdicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 
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Fig. 20 shows an alignment comparison of amino acid sequences for ORF 235 for 
several strains of Neisseria, Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig, 21 shows an alignment comparison of amino acid sequences for ORF 287 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 22 shows an alignment comparison of amino acid sequences for ORF 519 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 23 shows an alignment comparison of amino acid sequences for ORF 919 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

THE INVENTION 

The invention provides proteins comprising the N. meningitidis amino acid sequences 
and N. gonorrhoeae amino acid sequences disclosed in the examples. 

It also provides proteins comprising sequences homologous (i.e., those having 
sequence identity) to the N. meningitidis amino acid sequences disclosed in the examples. 
Depending on the particular sequence, the degree of homology (sequence identity) is 
preferably greater than 50% (eg. 60%, 70%, 80%, 90%, 95%, 99% or more). These proteins 
include mutants and allelic variants of the sequences disclosed in the examples. Typically, 
50% identity or more between two proteins is considered to be an indication of functional 
equivalence. Identity between proteins is preferably determined by the Smith- Waterman 
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homology search algorithm as implemented in MPSRCH program (Oxford Molecular) using 
an affine gap search with parametersrgap penalty 12, gap extension penalty 1. 

The invention further provides proteins comprising fragments of the N. meningitidis 
amino acid sequences and N. gonorrhoeae amino acid sequences disclosed in the examples. 
The fragments should comprise at least n consecutive amino acids from the sequences and, 
depending on the particular sequence,* is 7 or more {eg. 8, 10, 12, 14, 16, 18, 20 or more)! 
Preferably the fragments comprise an epitope from the sequence. 

The proteins of the invention can, of course, be prepared by various means (eg. 
recombinant expression, purification from cell culture, chemical synthesis etc.) and in various 
forms (eg. native, fusions etc.). They are preferably prepared in substantially pure or isolated 
form (ie. substantially free from other N. meningitidis or N. gonorrhoeae host cell proteins) 
According to a further aspect, the invention provides antibodies which bind to these 
proteins. These may be polyclonal or monoclonal and may be produced by any suitable 
means. 

According to a further aspect, the invention provides nucleic acid comprising the 
N. meningitidis nucleotide sequences and N. gonorrhoeae nucleotide sequences disclosed in 
the examples. 

According to a further aspect, the invention comprises nucleic acids having sequence 
identity of greater than 50% (e.g., 60%, 70%, 80%, 90%, 95%, 99% or more) to the nucleic 
acid sequences herein. Sequence identity is determined as above-discussed. 

According to a further aspect, the invention comprises nucleic acid that hybridizes to 
the sequences provided herein. Conditions for hybridization are set forth herein. 

Nucleic acid comprising fragments of these sequences are also provided. These should 
comprise at least n consecutive nucleotides from the N. meningitidis sequences or N. 
gonorrhoeae sequences and depending on the particular sequence, n is 10 or more (eg 12 14 
15, 18, 20, 25, 30, 35, 40 or more). 

According to a further aspect, the invention provides nucleic acid encoding the 
proteins and protein fragments of the invention. 

It should also be appreciated that the invention provides nucleic acid comprising 
sequences complementary to those described above (eg. for antisense or probing purposes) 

Nucleic acid according to the invention can, of course, be prepared in many ways ( eg . 
by chemical synthesis, in part or in whole, from genomic or cDNA libraries, from the 
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organism itself etc.) and can take various forms (eg. single stranded, double stranded, vectors, 
probes etc.). 

In addition, the term "nucleic acid" includes DNA and RNA, and also their analogues, 
such as those containing modified backbones, and also protein nucleic acids (PNA) etc. 

According to a further aspect, the invention provides vectors comprising nucleotide 
sequences of the invention (eg. expression vectors) and host cells transformed with such 
vectors. 

According to a further aspect, the invention provides compositions comprising protein, 
antibody, and/or nucleic acid according to the invention. These compositions may be suitable 
as vaccines, for instance, or as diagnostic reagents or as immunogenic compositions. 

The invention also provides nucleic acid, protein, or antibody according to the 
invention for use as medicaments (eg. as vaccines) or as diagnostic reagents. It also provides 
the use of nucleic acid, protein, or antibody according to the invention in the manufacture of 
(I) a medicament for treating or preventing infection due to Neisserial bacteria (ii) a 
diagnostic reagent for detecting the presence of Neisserial bacteria or of antibodies raised 
against Neisserial bacteria or (iii) for raising antibodies. Said Neisserial bacteria may be any 
species or strain (such as N. gonorrhoeae) but are preferably N. meningitidis, especially strain 
B or strain C. 

The invention also provides a method of treating a patient, comprising administering 
to the patient a therapeutically effective amount of nucleic acid, protein, and/or antibody 
according to the invention. 

According to further aspects, the invention provides various processes. 

A process for producing proteins of the invention is provided, comprising the step of 
culturing a host cell according to the invention under conditions which induce protein 
expression. 

A process for detecting polynucleotides of the invention is provided, comprising the 
steps of: (a) contacting a nucleic probe according to the invention with a biological sample 
under hybridizing conditions to form duplexes; and (b) detecting said duplexes. 

A process for detecting proteins of the invention is provided, comprising the steps of: 
(a) contacting an antibody according to the invention with a biological sample under 
conditions suitable for the formation of an antibody-antigen complexes; and (b) detecting 
said complexes. 
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Having now generally described the inventiion, the same will be more readily 
understood through reference to the following examples which are provided by way of 
illustration, and are not intended to be limiting of the present mvention, unless specified. 

Methodology - Summary of standard p rocedures and techniq ue 
General 

This mvention provides Neisseria meningitidis menB nucleotide sequences amino 
acid sequences encoded therein. With these disclosed sequences, nucleic acid probe assays 
and expression cassettes and vectors can be produced. The expression vectors can be 
transformed into host cells to produce proteins. The purified or isolated polypeptides (which 
may also be chemically synthesized) can be used to produce antibodies to detect menB 
proteins. Also, the host cells or extracts can be utilized for biological assays to isolate 
agonists or antagonists. In addition, with these sequences one can search to identify open 
reading frames and identify amino acid sequences. The proteins may also be used in 
immunogenic compositions, antigenic compositions and as vaccine components. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of molecular biology, microbiology, recombinant DNA, and 
immunology, which are within the skill of the art. Such techniques are explained fully in the 
literature e.g., Sambrook Molecular Cloning; A Laboratory Manual, Second Edition (1989)- 
DNA Cloning, Volumes land ii (D.N Glover ed. 1985); Oligonucleotide Synthesis (M J Gait 
ed, 1984); Nucleic Acid Hybridization (B.D. Hames & SJ. Higgins eds. 1984); Transcription 
and Translation (B.D. Hames & S.J. Higgi„ s eds . 1984); Animal Cell Culture (R.I. Freshney 
ed. 1986); Immobilized Cells and Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide to 
Molecular Cloning (1984); the Methods in Enzymology series (Academic Press Inc ) 
especially volumes 154 & 155; Gene Transfer Vectors for Mammalian Cells (J.H. Miller and 
M.P. Calos eds. 1987, Cold Spring Harbor Laboratory); Mayer and Walker, eds (1987) 
Immunochemical Methods in Cell and Molecular Biology (Academic Press, London); Scopes 
(1987) Protein Purification: Principles and Practice, Second Edition (Springer-Verlag, 
N.Y.), and Handbook of Experimental Immunology, Volumes I-IV (D.M. Weir and C C 
Blackwell eds 1986). 

Standard abbreviations for nucleotides and amino acids are used in this specification. 
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All publications, patents, and patent applications cited herein are incorporated in full 
by reference. 

Expression systems 

The Neisseria menB nucleotide sequences can be expressed in a variety of different 
expression systems; for example those used with mammalian cells, plant cells, baculoviruses, 
bacteria, and yeast. 

i. Mammalian Systems 

Mammalian expression systems are known in the art. A mammalian promoter is any 
DNA sequence capable of binding mammalian RNA polymerase and initiating the 
downstream (3') transcription of a coding sequence (e.g., structural gene) into mRNA. A 
promoter will have a transcription initiating region, which is usually placed proximal to the 5' 
end of the coding sequence, and a TATA box, usually located 25-30 base pairs (bp) upstream 
of the transcription initiation site. The TATA box is thought to direct RNA polymerase II to 
begin RNA synthesis at the correct site. A mammalian promoter will also contain an 
upstream promoter element, usually located within 100 to 200 bp upstream of the TATA box. 
An upstream promoter element determines the rate at which transcription is initiated and can 
act in either orientation (Sambrook et al. (1989) "Expression of Cloned Genes in Mammalian 
Cells." In Molecular Cloning: A Laboratory Manual 2nd ed.). 

Mammalian viral genes are often highly expressed and have a broad host range; 
therefore sequences encoding mammalian viral genes provide particularly useful promoter 
sequences. Examples include the SV40 early promoter, mouse mammary tumor virus LTR 
promoter, adenovirus major late promoter (Ad MLP), and herpes simplex virus promoter. In 
addition, sequences derived from non-viral genes, such as the murine metallothionein gene, 
also provide useful promoter sequences. Expression may be either constitutive or regulated 
(inducible). Depending on the promoter selected, many promotes may be inducible using 
known substrates, such as the use of the mouse mammary tumor virus (MMTV) promoter 
with the glucocorticoid responsive element (GRE) that is induced by glucocorticoid in 
hormone-responsive transformed cells (see for example, U.S. Patent 5,783,681). 

The presence of an enhancer element (enhancer), combined with the promoter 
elements described above, will usually increase expression levels. An enhancer is a 
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regulatory DNA sequence that can stimulate transcription up to 1000-fold when linked to 
homologous or heterologous promoters, with synthesis beginning at the normal RNA start 
site. Enhancers are also active when they are placed upstream or downstream from the 
transcription initiation site, in either normal or flipped orientation, or at a distance of more 
than 1000 nucleotides from the promoter (Maniatis et al. (1987) Science 236:1237; Alberts et 
al. (1989) Molecular Biology of the Cell, 2nd ed.). Enhancer elements derived from viruses 
may be particularly useful, because they usually have a broader host range. Examples include 
the SV40 early gene enhancer (Dijkema et al (1985) EMBO J. 4:761) and the 
enhancer/promoters derived from the long terminal repeat (LTR) of the Rous Sarcoma Virus 
(Gorman et al. (1982b) Proc. Natl. Acad. Sci. 79:6777) and from human cytomegalovirus 
(Boshart et al. (1985) Cell .7:521). Additionally, some enhancers are regulable and become 
active only m the presence of an inducer, such as a hormone or metal ion (Sassone-Corsi and 
Borelli (1986) Trends Genet. 2:215; Maniatis et al. (1987) Science 236:1237). 

A DNA molecule may be expressed intracellularly in mammalian cells. A promoter 
sequence may be directly linked with the DNA molecule, in which case the first amino acid at 
the N-terminus of the recombinant protein will always be a methionine, which is encoded by 
the ATG start codon. If desired, the N-terminus may be cleaved from the protein by in vitro 
incubation with cyanogen bromide. 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provides for secretion of the foreign protein in mammalian cells. 
Preferably, there are processing sites encoded between the leader fragment and the foreign 
gene that can be cleaved either in vivo or in vitro. The leader sequence fragment usually 
encodes a signal peptide comprised of hydrophobic amino acids which direct the secretion of 
the protein from the cell. The adenovirus tripartite leader is an example of a leader sequence 
that provides for secretion of a foreign protein in mammalian cells. 

Usually, transcription termination and polyadenylation sequences recognized by 
mammalian cells are regulatory regions located 3' to the translation stop codon and thus 
together with the promoter elements, flank the coding sequence. The 3' terminus of the ' 
mature mRNA is formed by site-specific post-transcriptional cleavage and polyadenylation 
(Bnnstiel et al. (1985) Cell .7:349; Proudfoot and Whitelaw (1988) "Termination and 3' end 
processing of eukaryotic RNA. In Transcription and splicing (ed. B.D. Hames and D M 
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Glover); Proudfoot (1989) Trends Biochem. ScL 74:105). These sequences direct the 
transcription of anmRNA which can be translated into the polypeptide encoded by the DNA. 
Examples of transcription terminator/polyadenylation signals include those derived from 
SV40 (Sambrook et al (1989) "Expression of cloned genes in cultured mammalian cells." In 
Molecular Cloning: A Laboratory Manual). 

Usually, the above described components, comprising a promoter, polyadenylation 
signal, and transcription termination sequence are put together into expression constructs. 
Enhancers, introns with functional splice donor and acceptor sites, and leader sequences may 
also be included in an expression construct, if desired. Expression constructs are often 
maintained in a replicon, such as an extrachromosomal element (e.g., plasmids) capable of 
stable maintenance in a host, such as mammalian cells or bacteria. Mammalian replication 
systems include those derived from animal viruses, which require trans-acting factors to 
replicate. For example, plasmids containing the replication systems of papovaviruses, such as 
SV40 (Gluzman (1981) Cell 23:175) or polyomavirus, replicate to extremely high copy 
number in the presence of the appropriate viral T antigen. Additional examples of 
mammalian replicons include those derived from bovine papillomavirus and Epstein-Barr 
virus. Additionally, the replicon may have two replication systems, thus allowing it to be 
maintained, for example, in mammalian cells for expression and in a prokaryotic host for 
cloning and amplification. Examples of such mammalian-bacteria shuttle vectors include 
pMT2 (Kaufman et al. (1989) Mol Cell Biol 9:946) and pHEBO (Shimizu et al. (1986) Mol 
Cell Biol 5:1074). 

The transformation procedure used depends upon the host to be transformed. Methods 
for introduction of heterologous polynucleotides into mammalian cells are known in the art 
and include dextran-mediated transfection, calcium phosphate precipitation, polybrene 
mediated transfection, protoplast fusion, electroporation, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

Mammalian cell lines available as hosts for expression are known in the art and 
include many immortalized cell lines available from the American Type Culture Collection 
(ATCC), including but not limited to, Chinese hamster ovary (CHO) cells, HeLa cells, baby 
hamster kidney (BHK) cells, monkey kidney cells (COS), human hepatocellular carcinoma 
cells (e.g., Hep G2), and a number of other cell lines. 
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ii. Plant Cellular Expression Systems 

There are many plant cell culture and whole plant genetic expression systems known 
m the art. Exemplary plant cellular genetic expression systems include those described in 
patents, such as: U.S. 5,693,506; US 5,659,122; and US 5,608,143. Additional examples of 
genetic expression in plant cell culture has been described by Zenk, Phytochemistry 303861- 
3863 (1991). Descriptions of plant protein signal peptides may be found in addition to the 
references described above in Vaulcombe et al., Mol. Gen. Genet. 209:33-40 (1987); Chandler 
et al., Plant Molecular Biology 3:407-418 (1984); Rogers, J. Biol. Chem. 260-3731-3738 
(1985); Rothstein et al., Gene 55:353-356 (1987); Whittier et al., Nucleic Acids Research 
15:2515-2535 (1987); Wirsel et al., Molecular Microbiology 3:3-14 (1989); Yu et al Gene 
122:247-253 (1992). A description of the regulation of plant gene expression by the ' 
phytohormone, gibberellic acid and secreted enzymes induced by gibberellic acid can be 
found m R.L. Jones and J. MacMillin, Gibberellins: in: Advanced Plant Physiology,. Malcolm 
B. Wilkins, ed., 1984 Pitman Publishing Limited, London, pp. 21-52. References that 
describe other metabolically-regulated genes: Sheen, Plant Cell, 2:1027-1038(1990)- Maas et 
al., EMBOJ. 9:3447-3452 (1990); Benkel and Hickey, Proc. Natl. Acad. Sci. 84 1337-1339 
(1987) 

Typically, using techniques known in the art, a desired polynucleotide sequence is 
inserted into an expression cassette comprising genetic regulatory elements designed for 
operation in plants. The expression cassette is inserted into a desired expression vector with 
companion sequences upstream and downstream from the expression cassette suitable for 
expression in a plant host. The companion sequences will be of plasmid or viral origin and 
provide necessary characteristics to the vector to permit the vectors to move DNA from an 
original cloning host, such as bacteria, to the desired plant host. The basic bacterial/plant 
vector construct will preferably provide a broad host range prokaryote replication origin- a 
prokaryote selectable marker; and, for Agrobacterium transformations, T DNA sequences for 
Agrobacterium-mediated transfer to plant chromosomes. Where the heterologous gene is not 
readily amenable to detection, the construct will preferably also have a selectable marker gene 
suitable for determining if a plant cell has been transformed. A general review of suitable 
markers, for example for the members of the grass family, is found in Wilmink and Dons, 
1993, Plant Mol. Biol. Reptr, 1 1(2): 165-1 85. 
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Sequences suitable for permitting integration of the heterologous sequence into the 
plant genome are also recommended. These might include transposon sequences and the like 
for homologous recombination as well as Ti sequences which permit random insertion of a 
heterologous expression cassette into a plant genome. Suitable prokaryote selectable markers 
include resistance toward antibiotics such as ampicillin or tetracycline. Other DNA sequences 
encoding additional functions may also be present in the vector, as is known in the art. 

The nucleic acid molecules of the subject invention may be included into an 
expression cassette for expression of the protein(s) of interest. Usually, there will be only one 
expression cassette, although two or more are feasible. The recombinant expression cassette 
will contain in addition to the heterologous protein encoding sequence the following 
elements, a promoter region, plant 5' untranslated sequences, initiation codon depending upon 
whether or not the structural gene comes equipped with one, and a transcription and 
translation termination sequence. Unique restriction enzyme sites at the 5' and 3' ends of the 
cassette allow for easy insertion into a pre-existing vector. 

A heterologous coding sequence may be for any protein relating to the present 
invention. The sequence encoding the protein of interest will encode a signal peptide which 
allows processing and translocation of the protein, as appropriate, and will usually lack any 
sequence which might result in the binding of the desired protein of the invention to a 
membrane. Since, for the most part, the transcriptional initiation region will be for a gene 
which is expressed and translocated during germination, by employing the signal peptide 
which provides for translocation, one may also provide for translocation of the protein of 
interest. In this way, the protein(s) of interest will be translocated from the cells in which they 
are expressed and may be efficiently harvested. Typically secretion in seeds are across the 
aleurone or scutellar epithelium layer into the endosperm of the seed. While it is not required 
that the protein be secreted from the cells in which the protein is produced, this facilitates the 
isolation and purification of the recombinant protein. 

Since the ultimate expression of the desired gene product will be in a eucaryotic cell it 
is desirable to determine whether any portion of the cloned gene contains sequences which 
will be processed out as introns by the host's splicosome machinery. If so, site-directed 
mutagenesis of the "intron" region may be conducted to prevent losing a portion of the genetic 
message as a false intron code, Reed and Maniatis, Cell 41:95-105, 1985. 



> 995728QA2J_> 



WO 99/57280 ^ ^ PCT/US99/09346 

14 

The vector can be microinjected directly into plant cells by use of micropipettes to 
mechanically transfer the recombinant DNA. Crossway, Mol. Gen. Genet, 202:179-185, 
1985. The genetic material may also be transferred into the plant cell by using polyethylene 
glycol, Krens, et al., Nature, 296, 72-74, 1982. Another method of introduction of nucleic 
acid segments is high velocity ballistic penetration by small particles with the nucleic acid 
either within the matrix of small beads or particles, or on the surface, Klein, et al., Nature, 
327, 70-73, 1987 and Knudsen and Muller, 1991, Planta, 185:330-336 teaching particle ' 
bombardment of barley endosperm to create transgenic barley. Yet another method of 
introduction would be fusion of protoplasts with other entities, either minicells, cells, 
lysosomes or other fusible lipid-surfaced bodies, Fraley, et al., Proc. Natl. Acad. Sci ' USA 79 
1859-1863, 1982. ' ' 

The vector may also be introduced into the plant cells by electroporation. (Fromm et 
al., Proc. Natl Acad. Sci. USA 82:5824, 1985). In this technique, plant protoplasts are 
electroporated in the presence of plasmids containing the gene construct. Electrical impulses 
of high field strength reversibly permeabilize biomembranes allowing the introduction of the 
plasmids. Electroporated plant protoplasts reform the cell wall, divide, and form plant callus. 

All plants from which protoplasts can be isolated and cultured to give whole 
regenerated plants can be transformed by the present invention so that whole plants are 
recovered which contain the transferred gene. It is known that practically all plants can be 
regenerated from cultured cells or tissues, including but not limited to all major species of 
sugarcane, sugar beet, cotton, fruit and other trees, legumes and vegetables. Some suitable 
plants include, for example, species from the genera Fragaria, Lotus, Medicago, Onobrychis 
Trifolium, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, 
Brassica, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Lycopersion, 
Nicotiana, Solanum, Petunia, Digitalis, Majorana, Cichorium, Helianthus, Lactuca, Bromus 
Asparagus, Antirrhinum, Hererocallis, Nemesia, Pelargonium, Panicum, Pennisetum, 
Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Lolium, Zea, Triticum, 
Sorghum, and Datura. 

Means for regeneration vary from species to species of plants, but generally a 
suspension of transformed protoplasts containing copies of the heterologous gene is first 
provided. Callus tissue is formed and shoots may be induced from callus and subsequently 
rooted. Alternatively, embryo formation can be induced from the protoplast suspension 
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These embryos germinate as natural embryos to form plants. The culture media will generally 
contain various amino acids and hormones, such as auxin and cytokinins. It is also 
advantageous to add glutamic acid and proline to the medium, especially for such species as 
corn and alfalfa. Shoots and roots normally develop simultaneously. Efficient regeneration 
will depend on the medium, on the genotype, and on the history of the culture. If these three 
variables are controlled, then regeneration is fully reproducible and repeatable. 

In some plant cell culture systems, the desired protein of the invention may be 
excreted or alternatively, the protein may be extracted from the whole plant. Where the 
desired protein of the invention is secreted into the medium, it may be collected. 
Alternatively, the embryos and embryoless-half seeds or other plant tissue may be 
mechanically disrupted to release any secreted protein between cells and tissues. The mixture 
may be suspended in a buffer solution to retrieve soluble proteins. Conventional protein 
isolation and purification methods will be then used to purify the recombinant protein. 
Parameters of time, temperature pH, oxygen, and volumes will be adjusted through routine 
methods to optimize expression and recovery of heterologous protein. 

hi. Baculovirus Systems 

The polynucleotide encoding the protein can also be inserted into a suitable insect 
expression vector, and is operably linked to the control elements within that vector. Vector > 
construction employs techniques which are known in the art. Generally, the components of 
the expression system include a transfer vector, usually a bacterial plasmid, which contains 
both a fragment of the baculovirus genome, and a convenient restriction site for insertion of 
the heterologous gene or genes to be expressed; a wild type baculovirus with a sequence 
homologous to the baculovims-specific fragment in the transfer vector (this allows for the 
homologous recombination of the heterologous gene in to the baculovirus genome); and 
appropriate insect host cells and growth media. 

After inserting the DNA sequence encoding the protein into the transfer vector, the 
vector and the wild type viral genome are transfected into an insect host cell where the vector 
and viral genome are allowed to recombine. The packaged recombinant virus is expressed 
and recombinant plaques are identified and purified. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, inter 
alia, Invitrogen, San Diego CA ("MaxBac" kit). These techniques are generally known to 
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those skilled in the art and fully described in Summers and Smith, Texas Agricultural 
Experiment Station Bulletin No. 1555 (1987) (hereinafter "Summers and Smith"). 

Prior to inserting the DNA sequence encoding the protein into the baculovirus 
genome, the above described components, comprising a promoter, leader (if desired), coding 
sequence of interest, and transcription termination sequence, are usually assembled into an 
intermediate transplacement construct (transfer vector). This construct may contain a single 
gene and operably linked regulatory elements; multiple genes, each with its owned set of 
operably linked regulatory elements; or multiple genes, regulated by the same set of 
regulatory elements. Intermediate transplacement constructs are often maintained in a 
replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable maintenance 
m a host, such as a bacterium. The replicon will have a replication system, thus allowing it to 
be maintained in a suitable host for cloning and amplification. 

Currently, the most commonly used transfer vector for introducing foreign genes into 
AcNPV is P Ac373. Many other vectors, known to those of skill in the art, have also been 
designed. These include, for example, pVL985 (which alters the polyhedrin start codon from 
ATG to ATT, and which introduces a BamHI cloning site 32 basepairs downstream from the 
ATT; see Luckow and Summers, Virology (1989) 77:31. 

The plasmid usually also contains the polyhedrin polyadenylation signal (Miller et al 
(1988)^1, Rev. Microbiol., 42:111) and a prokaryotic ampicillin-resistance (amp) gene and 
origin of replication for selection and propagation in E. coli. 

Baculovirus transfer vectors usually contain a baculovirus promoter. A baculovirus 
promoter is any DNA sequence capable of binding a baculovirus RNA polymerase and 
initiating the downstream (5« to 3') transcription of a coding sequence (e.g., structural gene) 
into mRNA. A promoter will have a transcription initiation region which is usually placed 
proximal to the 5' end of the coding sequence. This transcription initiation region usually 
includes an RNA polymerase binding site and a transcription initiation site. A baculovirus 
transfer vector may also have a second domain called an enhancer, which, if present is 
usually distal to the structural gene. Expression may be either regulated or constitutive. 

Structural genes, abundantly transcribed at late times in a viral infection cycle provide 
particularly useful promoter sequences. Examples include sequences derived from the gene 
encoding the viral polyhedron protein, Friesen et al., (1986) "The Regulation of Baculovirus 
Gene Expression," in: The Molecular Biology ofBaculoviruses (ed. Walter Doerfler)- EPO 
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Publ. Nos. 127 839 and 155 476; and the gene encoding the pi 0 protein, Vlak et al., (1988), J. 
Gen. Virol 69:165. 

DNA encoding suitable signal sequences can be derived from genes for secreted insect 
or baculovirus proteins, such as the baculovirus polyhedrin gene (Carbonell et al. (1988) 
Gene, 73:409). Alternatively, since the signals for mammalian cell posttranslational 
modifications (such as signal peptide cleavage, proteolytic cleavage, and phosphorylation) 
appear to be recognized by insect cells, and the signals required for secretion and nuclear 
accumulation also appear to be conserved between the invertebrate cells and vertebrate cells, 
leaders of non-insect origin, such as those derived from genes encoding human (alpha) a- 
interferon, Maeda et al., (1985), Nature 315:592; human gastrin-releasing peptide, Lebacq- 
Verheyden et al., (1988), Molec. Cell Biol 5:3129; human IL-2, Smith et al., (1985) Proc. 
Nat 9 ! Acad. Sci. USA, 52:8404; mouse IL-3, (Miyajima et al., (1987) Gene 58:273; and human 
glucocerebrosidase, Martin et al. (1988) DNA, 7:99, can also be used to provide for secretion 
in insects. 

A recombinant polypeptide or polyprotein may be expressed intracellularly or, if it is 
expressed with the proper regulatory sequences, it can be secreted. Good intracellular 
expression of nonfused foreign proteins usually requires heterologous genes that ideally have 
a short leader sequence containing suitable translation initiation signals preceding an ATG 
start signal. If desired, methionine at the N-terminus may be cleaved from the mature protein^ 
by in vitro incubation with cyanogen bromide. 

Alternatively, recombinant polyproteins or proteins which are not naturally secreted 
can be secreted from the insect cell by creating chimeric DNA molecules that encode a fusion 
protein comprised of a leader sequence fragment that provides for secretion of the foreign 
protein in insects. The leader sequence fragment usually encodes a signal peptide comprised 
of hydrophobic amino acids which direct the translocation of the protein into the endoplasmic 
reticulum. 

After insertion of the DNA sequence and/or the gene encoding the expression product 
precursor of the protein, an insect cell host is co-transformed with the heterologous DNA of 
the transfer vector and the genomic DNA of wild type baculovirus - usually by co- 
transfection. The promoter and transcription termination sequence of the construct will 
usually comprise a 2-5kb section of the baculovirus genome. Methods for introducing 
heterologous DNA into the desired site in the baculovirus virus are known in the art. (See 
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Summers and Smithy; Ju et al. (1987); Smith et al.,M>/. Cell. Biol. (1983) 5-2156- and 
Luckow and Summers (1989)). For example, the insertion can be into a gene such as the 
polyhednn gene, by homologous double crossover recombination; insertion can also be into a 
restriction enzyme site engineered into the desired baculovirus gene. Miller et al (1989) 
Bioessays 4:91. The DNA sequence, when cloned in place of the polyhedrin gene in the ' 
expression vector, is flanked both 5' and 3' by polyhedrin-specific sequences and is positioned 
downstream of the polyhedrin promoter. 

The newly formed baculovirus expression vector is subsequently packaged into an 
infectious recombinant baculovirus. Homologous recombination occurs at low frequency 
(between about 1% and about 5%); thus, the majority of the virus produced after 
cotransfection is still wild-type virus. Therefore, a method is necessary to identify 
recombinant viruses. An advantage of the expression system is a visual screen allowing 
recombinant viruses to be distinguished. The polyhedrin protein, which is produced by the 
native virus, is produced at very high levels in the nuclei of infected cells at late times after 
viral infection. Accumulated polyhedrin protein forms occlusion bodies that also contain 
embedded particles. These occlusion bodies, up to 15 urn in size, are highly refractile giving 
them a bright shiny appearance that is readily visualized under the light microscope Cells 
infected with recombinant viruses lack occlusion bodies. To distinguish recombinant virus 
from wild-type virus, the transfection supernatant is plagued onto a monolayer of insect cells 
by techniques known to those skilled in the art. Namely, the plaques are screened under the 
light microscope for the presence (indicative of wild-type virus) or absence (indicative of 
recombinant virus) of occlusion bodies. Current Protocols in Microbiology Vol 2 (Ausubel 
et al. eds) at 16.8 (Supp. 10, 1990); Summers and Smith, supra; Miller et al. (1989). 

Recombinant baculovirus expression vectors have been developed for infection into 
several insect cells. For example, recombinant baculoviruses have been developed for inter 
aha: Aedes aegypti , Autographa californica, Bombyx mori, Drosophila melanogaster 
Spodopterajrugiperda, and Trichoplusia ni (PCT Pub. No. WO 89/046699; Carbonell et al 
(1985) J. Virol. 55:153; Wright (1986) Nature 527:718; Smith et al., (1983) Mol. Cell. Biol ' 
5:2156; and see generally, Fraser, et al. (1989) In Vitro Cell. Dev. Biol. 25:225). 

Cells and cell culture media are commercially available for both direct and fusion 
expression of heterologous polypeptides in a baculovirus/expression system; cell culture 
technology is generally known to those skilled in the art. See, e.g., Summers and Smith supra 
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The modified insect cells may then be grown in an appropriate nutrient medium, 
which allows for stable maintenance of the plasmid(s) present in the modified insect host. 
Where the expression product gene is under inducible control, the host may be grown to high 
density, and expression induced. Alternatively, where expression is constitutive, the product 
will be continuously expressed into the medium and the nutrient medium must be 
continuously circulated, while removing the product of interest and augmenting depleted 
nutrients. The product may be purified by such techniques as chromatography, e.g., HPLC, 
affinity chromatography, ion exchange chromatography, etc.; electrophoresis; density gradient 
centrifugation; solvent extraction, or the like. As appropriate, the product may be further 
purified, as required, so as to remove substantially any insect proteins which are also secreted 
in the medium or result from lysis of insect cells, so as to provide a product which is at least 
substantially free of host debris, e.g., proteins, lipids and polysaccharides. 

In order to obtain protein expression, recombinant host cells derived from the 
transformants are incubated under conditions which allow expression of the recombinant 
protein encoding sequence. These conditions will vary, dependent upon the host cell selected. 
However, the conditions are readily ascertainable to those of ordinary skill in the art, based 
upon what is known in the art. 

iv. Bacterial Systems 

Bacterial expression techniques are known in the art. A bacterial promoter is any DNA 
sequence capable of binding bacterial RNA polymerase and initiating the downstream (3') 
transcription of a coding sequence (e.g. structural gene) into mRNA. A promoter will have a 
transcription initiation region which is usually placed proximal to the 5' end of the coding 
sequence. This transcription initiation region usually includes an RNA polymerase binding 
site and a transcription initiation site. A bacterial promoter may also have a second domain 
called an operator, that may overlap an adjacent RNA polymerase binding site at which RNA 
synthesis begins. The operator permits negative regulated (inducible) transcription, as a gene 
repressor protein may bind the operator and thereby inhibit transcription of a specific gene. 
Constitutive expression may occur in the absence of negative regulatory elements, such as the 
operator. In addition, positive regulation may be achieved by a gene activator protein binding 
sequence, which, if present is usually proximal (5') to the RNA polymerase binding sequence. 
An example of a gene activator protein is the catabolite activator protein (CAP), which helps 
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initiate transcription of the lac operon in Escherichia coli (E. coli) (Raibaud et al. (1984) 
Annu. Rev. Genet. 18:113). Regulated expression may therefore be either positive or negative, 
thereby either enhancing or reducing transcription. 

Sequences encoding metabolic pathway enzymes provide particularly useful promoter 
sequences. Examples include promoter sequences derived from sugar metabolizing enzymes, 
such as galactose, lactose {lac) (Chang et al. (1977) Nature 198:1056), and maltose. 
Additional examples include promoter sequences derived from biosynthetic enzymes such as 
tryptophan (trp) (Goeddel et al. (1980) Nuc. Acids Res. 5:4057; Yelverton et al. (1981) Nucl. 
Acids Res. 9:131; U.S. Patent 4,738,921; EPO Publ. Nos. 036 776 and 121 775). The beta- 
lactamase (bid) promoter system (Weissmann (1981) "The cloning of interferon and other 
mistakes." In Interferon 3 (ed. I. Gresser)), bacteriophage lambda PL (Shimatake et al (1981) 
Nature 292:12%) and T5 (U.S. Patent 4,689,406) promoter systems also provide useful 
promoter sequences. 

In addition, synthetic promoters which do not occur in nature also function as bacterial 
promoters. For example, transcription activation sequences of one bacterial or bacteriophage 
promoter may be joined with the operon sequences of another bacterial or bacteriophage 
promoter, creating a synthetic hybrid promoter (U.S. Patent 4,551,433). For example, the tac 
promoter is a hybrid trp-lac promoter comprised of both trp promoter and lac operon 
sequences that is regulated by the lac repressor (Amann et al. (1983) Gene 25:161; de Boer et 
al. (1983) Proc. Natl. Acad. Sci. 80:21). Furthermore, a bacterial promoter can include 
naturally occurring promoters of non-bacterial origin that have the ability to bind bacterial 
RNA polymerase and initiate transcription. A naturally occurring promoter of non-bacterial 
origin can also be coupled with a compatible RNA polymerase to produce high levels of 
expression of some genes in prokaryotes. The bacteriophage T7 RNA polymerase/promoter 
system is an example of a coupled promoter system (Studier et al. (1986) J. Mol. Biol. 
189:113; Tabor et al. (1985) Proc Natl. Acad. Sci. 82:1014). In addition, a hybrid promoter 
can also be comprised of a bacteriophage promoter and an E. coli operator region (EPO Publ 
No. 267 851). 

In addition to a functioning promoter sequence, an efficient ribosome binding site is 
also useful for the expression of foreign genes in prokaryotes. In E. coli, the ribosome binding 
site is called the Shine-Dalgarno (SD) sequence and includes.an initiation codon (ATG) and a 
sequence 3-9 nucleotides in length located 3-11 nucleotides upstream of the initiation codon 
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(Shine et al. (1975) Nature 254:34). The SD sequence is thought to promote binding of 
mRNA to the ribosome by the pairing of bases between the SD sequence and the 3' end of 
E. coli 16S rRNA (Steitz et al. (1979) "Genetic signals and nucleotide sequences in 
messenger RNA." In Biological Regulation and Development: Gene Expression (ed. R.F. 
Goldberger)). To express eukaryotic genes and prokaryotic genes with weak ribosome- 
binding site, it is often necessary to optimize the distance between the SD sequence and the 
ATG of the eukaryotic gene (Sambrook et al. (1989) "Expression of cloned genes in 
Escherichia coli." In Molecular Cloning: A Laboratory Manual). 

A DNA molecule may be expressed intracellularly. A promoter sequence may be 
directly linked with the DNA molecule, in which case the first amino acid at the N-terminus 
will always be a methionine, which is encoded by the ATG start codon. If desired, methionine 
at the N-terrninus may be cleaved from the protein by in vitro incubation with cyanogen 
bromide or by either in vivo or in vitro incubation with a bacterial methionine N-terminal 
peptidase (EPO Publ. No. 219 237). 

Fusion proteins provide an alternative to direct expression. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous bacterial protein, or other stable protein, is 
fused to the 5' end of heterologous coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For example, the bacteriophage lambda cell 
gene can be linked at the 5' terminus of a foreign gene and expressed in bacteria. The resulting 
fusion protein preferably retains a site for a processing enzyme (factor Xa) to cleave the 
bacteriophage protein from the foreign gene (Nagai et al. (1984) Nature 309:810). Fusion 
proteins can also be made with sequences from the lacL (Jia et al. (1987) Gene 60:197), trpE 
(Allen et al. (1987) J. Biotechnol. 5:93; Makoff et al. (1989) J. Gen. Microbiol. 1 35:11), and 
Chey (EPO Publ. No. 324 647) genes. The DNA sequence at the junction of the two amino 
acid sequences may or may not encode a cleavable site. Another example is a ubiquitin fusion 
protein. Such a fusion protein is made with the ubiquitin region that preferably retains a site 
for a processing enzyme (e.g. ubiquitin specific processing-protease) to cleave the ubiquitin 
from the foreign protein. Through this method, native foreign protein can be isolated (Miller 
etal. (1989) Bio/Technology 7:698). 

Alternatively, foreign proteins can also be secreted from the cell by creating chimeric 
DNA molecules that encode a fusion protein comprised of a signal peptide sequence fragment 
that provides for secretion of the foreign protein in bacteria (U.S. Patent 4,336,336). The 
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signal sequence fragment usually encodes a signal peptide composed of hydrophobic amino 
acids which direct the secretion of the protein from the cell. The protein is either secreted into 
the growth media (gram-positive bacteria) or into the periplastic space, located between the 
umer and outer membrane of the cell (gram-negative bacteria). Preferably there are processing 
sites, which can be cleaved either in vivo or in vitro encoded between the signal peptide 
fragment and the foreign gene. 

DNA encoding suitable signal sequences can be derived from genes for secreted 
bacterial proteins, such as the E. coli outer membrane protein gene (ompA) (Masui et al. 
(1983), in: Experimental Manipulation of Gene Expression; Ghrayeb et al. (1984) EMBOJ. 
5:2437) and the E. coli alkaline phosphatase signal sequence (phoA) (Oka et al (1985) Proa 
Natl. Acad. Sci. 82:1212). As an additional example, the signal sequence of the alpha-amylase 
gene from various Bacillus strains can be used to secrete heterologous proteins from B 
subtilis (Palva et al. (1982) Proc. Natl. Acad. Sci. USA 79:5582; EPO Publ. No. 244 042). 

Usually, transcription termination sequences recognized by bacteria are regulatory 
regions located 3' to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Transcription termination sequences 
frequently include DNA sequences of about 50 nucleotides capable of forming stem loop 
structures that aid in terminating transcription. Examples include transcription termination 
sequences derived from genes with strong promoters, such as the trp gene in E. coli as well as 
other biosynthetic genes. 

Usually, the above described components, comprising a promoter, signal sequence (if 
desired), coding sequence of interest, and transcription termination sequence, are put together 
into expression constructs. Expression constructs are often maintained in a replicon such as 
an extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
bacteria. The replicon will have a replication system, thus allowing it to be maintained in a 
prokaryotic host either for expression or for cloning and amplification. In addition, a replicon 
may be either a high or low copy number plasmid. A high copy number plasmid will 
generally have a copy number ranging from about 5 to about 200, and usually about 10 to 
about 150. A host containing a high copy number plasmid will preferably contain at least 
about 10, and more preferably at least about 20 plasmids. Either a high or low copy number 
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vector may be selected, depending upon the effect of the vector and the foreign protein on the 
host. 

Alternatively, the expression constructs can be integrated into the bacterial genome 
with an integrating vector. Integrating vectors usually contain at least one sequence 
homologous to the bacterial chromosome that allows the vector to integrate. Integrations 
appear to result from recombinations between homologous DNA in the vector and the 
bacterial chromosome. For example, integrating vectors constructed with DNA from various 
Bacillus strains integrate into the Bacillus chromosome (EPO Publ. No. 127 328). Integrating 
vectors may also be comprised of bacteriophage or transposon sequences. 

Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of bacterial strains that have been transformed. 
Selectable markers can be expressed in the bacterial host and may include genes which render 
bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, kanamycin 
(neomycin), and tetracycline (Davies et al (1978) Annu. Rev. Microbiol 32:469). Selectable 
markers may also include biosynthetic genes, such as those in the histidine, tryptophan, and 
leucine biosynthetic pathways. 

Alternatively, some of the above described components can be put together in 
transformation vectors. Transformation vectors are usually comprised of a selectable market 
that is either maintained in a replicon or developed into an integrating vector, as described - 
above. 

Expression and transformation vectors, either extra-chromosomal replicons or 
integrating vectors, have been developed for transformation into many bacteria. For example, 
expression vectors have been developed for, inter alia, the following bacteria: Bacillus 
subtilis (Palva et al (1982) Proc. Natl. Acad. Sci. USA 7P:5582; EPO Publ. Nos. 036 259 and 
063 953; PCT Publ. No. WO 84/04541), Escherichia coli (Shimatake et al (1981) Nature 
292:128; Amann et al (1985) Gene 40:183; Studier et al (1986) J. Mol Biol 189:113; EPO 
Publ. Nos. 036 776, 136 829 and 136 907), Streptococcus cremoris (Powell et al (1988) Appl 
Environ. Microbiol 54:655); Streptococcus lividans (Powell et al (\9%%)Appl Environ. 
Microbiol 54:655), Streptomyces lividans (U.S. Patent 4,745,056). 

Methods of introducing exogenous DNA into bacterial hosts are well-known in the art, 
and usually include either the transformation of bacteria treated with CaCl 2 or other agents, 
such as divalent cations and DMSO. DNA can also be introduced into bacterial cells by 
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electroporation. Transformation procedures usually vary with the bacterial species to be 
transformed. (See e.g., use of Bacillus: Masson et al. (1989) FEMS Microbiol. Lett. 60:213- 
Palva et al. (1982) Proc. Natl. Acad. Sci. USA 79:5582; EPO Publ. Nos. 036 259 and 063 953- 
PCT Publ. No. WO 84/04541; use of Campylobacter: Miller et al. (1988) Proc. Natl. Acad. ' 
Sci. 85:856; and Wang et al. (1990) J. Bacteriol. 272:949; use of Escherichia coli: Cohen et 
al. (1973)Proc. Natl. Acad. Sci. 5P:2110; Dower et al. (1988) Nucleic Acids Res. 7*6127; 
Kushner (1978) "An improved method for transformation of Escherichia coli with Coffil-' 
derived plasmids. In Genetic Engineering: Proceedings of the International Symposium on 
Genetic Engineering (eds. H. W. Boyer and S. Nicosia); Mandel et al. (1970) J. Mol. Biol 
55:159; Taketo (1988) Biochim. Biophys. Acta 949:318; use of Lactobacillus: Chassy etal. 
(1987) FEMS Microbiol. Lett. 44:113; use of Pseudomonas: Fiedlers al. (1988) Anal 
Biochem 170:38; use of Staphylococcus: Augustin et al. (1990) FEMS Microbiol. Lett. 
66:203; use of Streptococcus: Barany et al. (1980) J. Bacteriol. 144:698; Harlander (1987) 
"Transformation of Streptococcus lactis by electroporation, in: Streptococcal Genetics (ed. J 
Ferretti and R. Curtiss III); Perry et al. (1981) Infect. Immun. 52:1295; Powell et al. (1988) 
Appl. Environ. Microbiol. 54:655; Somkuti et al. (1987) Proc. 4th Evr. Cong. Biotechnology 
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v. Yeast Expression 



Yeast expression systems are also known to one of ordinary skill in the art. A yeast 
promoter is any DNA sequence capable of binding yeast RNA polymerase and initiating the 
downstream (3') transcription of a coding sequence (e.g. structural gene) into mRNA. A 
promoter will have a transcription initiation region which is usually placed proximal to the 5' 
end of the coding sequence. This transcription initiation region usually includes an RNA 
polymerase binding site (the "TATA Box") and a transcription initiation site. A yeast 
promoter may also have a second domain called an upstream activator sequence (UAS), 
which, if present, is usually distal to the structural gene. The UAS permits regulated 
(inducible) expression. Constitutive expression occurs in the absence of a UAS. Regulated 
expression may be either positive or negative, thereby either enhancing or reducing 
transcription. 

Yeast is a fermenting organism with an active metabolic pathway, therefore sequences 
encoding enzymes in the metabolic pathway provide particularly useful promoter sequences. 
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Examples include alcohol dehydrogenase (ADH) (EPO Publ. No. 284 044), enolase, 
glucokinase, glucose-6-phosphate isomerase, glyceraldehyde-3-phosphate-dehydrogenase 
(GAP or GAPDH), hexokinase, phosphofructokinase, 3-phosphogiycerate mutase, and 
pyruvate kinase (PyK) (EPO Publ. No. 329 203). The yeast PHOS gene, encoding acid 
phosphatase, also provides useful promoter sequences (Myanohara et al. (1983) Proc. Natl 
Acad. Sci. USA 80:1), 

In addition, synthetic promoters which do not occur in nature also function as yeast 
promoters. For example, UAS sequences of one yeast promoter may be joined with the 
transcription activation region of another yeast promoter, creating a synthetic hybrid 
promoter. Examples of such hybrid promoters include the ADH regulatory sequence linked to 
the GAP transcription activation region (U.S. Patent Nos. 4,876,197 and 4,880,734). Other 
examples of hybrid promoters include promoters which consist of the regulatory sequences of 
either the ADH2, GAL4, GAL 10, OR PHOS genes, combined with the transcriptional 
activation region of a glycolytic enzyme gene such as GAP or PyK (EPO Publ. No. 164 556). 
Furthermore, a yeast promoter can include naturally occurring promoters of non-yeast origin 
that have the ability to bind yeast RNA polymerase and initiate transcription. Examples of 
such promoters include, inter alia, (Cohen et al. (1980) Proc. Natl Acad. Sci. USA 77: 1078; 
Henikoff et al. (1981) Nature 2<S3:835; Hollenberg et al. (1981) Curr. Topics Microbiol. 
Immunol. 96:119; Hollenberg et al. (1979) "The Expression of Bacterial Antibiotic Resistance 
Genes in the Yeast Saccharomyces cerevisiae," in: Plasmids of Medical Environmental and 
Commercial Importance (eds. K.N. Timmis and A. Puhler); Mercerau-Puigalon et al (1980) 
Gene 77:163; Panthier et al (1980) Curr. Genet. 2:109;). 

A DNA molecule may be expressed intracellularly in yeast. A promoter sequence may 
be directly linked with the DNA molecule, in which case the first amino acid at the N- 
terminus of the recombinant protein will always be a methionine, which is encoded by the 
ATG start codon. If desired, methionine at the N-terminus may be cleaved from the protein by 
in vitro incubation with cyanogen bromide. 

Fusion proteins provide an alternative for yeast expression systems, as well as in 
mammalian, plant, baculovirus, and bacterial expression systems. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous yeast protein, or other stable protein, is 
fused to the 5' end of heterologous coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For example, the yeast or human 
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superoxide dismutase (SOD) gene, can be linked at the 5' terminus of a foreign gene and 
expressed in yeast. The DNA sequence at the junction of the two amino acid sequences may 
or may not encode a cleavable site. See e.g., EPO Publ. No. 196056. Another example is a 
ubiquitin fusion protein. Such a fusion protein is made with the ubiquitin region that 
preferably retains a site for a processing enzyme (e.g. ubiquitin-specific processing protease) 
to cleave the ubiquitin from the foreign protein. Through this method, therefore, native 
foreign protein can be isolated (e.g., WO88/024066). 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provide for secretion in yeast of the foreign protein. Preferably, there 
are processing sites encoded between the leader fragment and the foreign gene that can' be 
cleaved either in vivo or in vitro. The leader sequence fragment usually encodes a signal 
peptide comprised of hydrophobic amino acids which direct the secretion of the protein from 
the cell. 

DNA encoding suitable signal sequences can be derived from genes for secreted yeast 
proteins, such as the yeast invertase gene (EPO Publ. No. 012 873; JPO Publ. No. 
62:096,086) and the A-factor gene (U.S. Patent 4,588,684). Alternatively, leaders of non- 
yeast origin, such as an interferon leader, exist that also provide for secretion in yeast (EPO 
Publ. No. 060 057). 

A preferred class of secretion leaders are those that employ a fragment of the yeast 
alpha-factor gene, which contains both a "pre" signal sequence, and a "pro" region. The types 
of alpha-factor fragments that can be employed include the full-length pre-pro alpha factor 
leader (about 83 amino acid residues) as well as truncated alpha-factor leaders (usually about 
25 to about 50 amino acid residues) (U.S. Patent Nos. 4,546,083 and 4,870,008; EPO Publ. 
No. 324 274). Additional leaders employing an alpha-factor leader fragment that provides for 
secretion include hybrid alpha-factor leaders made with a presequence of a first yeast, but a 
pro-region from a second yeast alphafactor. (See e.g., PCT Publ. No. WO 89/02463.) 

Usually, transcription termination sequences recognized by yeast are regulatory 
regions located 3' to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Examples of transcription terminator 
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sequence and other yeast-recognized termination sequences, such as those coding for 
glycolytic enzymes. 

Usually, the above described components, comprising a promoter, leader (if desired), 
coding sequence of interest, and transcription termination sequence, are put together into 
expression constructs. Expression constructs are often maintained in a replicon, such as an 
extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
yeast or bacteria. The replicon may have two replication systems, thus allowing it to be 
maintained, for example, in yeast for expression and in a prokaryotic host for cloning and 
amplification. Examples of such yeast-bacteria shuttle vectors include YEp24 (Botstein et al 
(1979) Gene 5:17-24), pCl/1 (Brake et al (1984) Proc. Natl Acad. Sci USA 57:4642-4646), 
and YRpl7 (Stinchcomb et al (1982) J. Mol Biol 158:151). In addition, a replicon may be 
either a high or low copy number plasmid. A high copy number plasmid will generally have a 
copy number ranging from about 5 to about 200, and usually about 10 to about 150. A host 
containing a high copy number plasmid will preferably have at least about 10, and more 
preferably at least about 20. Enter a high or low copy number vector may be selected, 
depending upon the effect of the vector and the foreign protein on the host. See e.g., Brake et 
al, supra. 

Alternatively, the expression constructs can be integrated into the yeast genome with 
an integrating vector. Integrating vectors usually contain at least one sequence homologous to 
a yeast chromosome that allows the vector to integrate, and preferably contain two 
homologous sequences flanking the expression construct. Integrations appear to result from 
recombinations between homologous DNA in the vector and the yeast chromosome (Orr- 
Weaver et al (1983) Methods in Enzymol 707:228-245). An integrating vector may be 
directed to a specific locus in yeast by selecting the appropriate homologous sequence for 
inclusion in the vector. See Orr- Weaver et al, supra. One or more expression construct may 
integrate, possibly affecting levels of recombinant protein produced (Rine et al (1983) Proc. 
Natl Acad. Sci. USA 80:6150). The chromosomal sequences included in the vector can occur 
either as a single segment in the vector, which results in the integration of the entire vector, or 
two segments homologous to adjacent segments in the chromosome and flanking the 
expression construct in the vector, which can result in the stable integration of only the 
expression construct. 



> 9957280A2J_> 



WO 99/57280 ^ PCT/US99/09346 

28 

Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of yeast strains that have been transformed. 
Selectable markers may include biosynthetic genes that can be expressed in the yeast host 
such as ADE2, HIS4, LEU2, TRP1, and ALG7, and the G41 8 resistance gene, which confer 
resistance in yeast cells to tunicamycin and G418, respectively. In addition, a suitable 
selectable marker may also provide yeast with the ability to grow in the presence of toxic 
compounds, such as metal. For example, the presence of CUP J allows yeast to grow in the 
presence of copper ions (Butt et al. (1987) Microbiol, Rev. 57:351). 

Alternatively, some of the above described components can be put together into 
transformation vectors. Transformation vectors are usually comprised of a selectable marker 
that is either maintained in a replicon or developed into an integrating vector, as described 
above. 

Expression and transformation vectors, either extrachromosomal replicons or 
integrating vectors, have been developed for transformation into many yeasts. For example, 
expression vectors and methods of introducing exogenous DNA into yeast hosts have been' 
developed for, inter alia, the following yeasts: Candida albicans (Kurtz, et al. (1986) Mol. 
Cell. Biol. 6:142); Candida maltosa (Kunze, et al. (1985)./. Basic Microbiol. 25:141); 
Hansenulapolymorpha (Gleeson, et al. (1986) J. Gen. Microbiol. 232:3459; Roggenkamp et 
al. (1986) Mol. Gen. Genet. 202:302); Kluyveromyces fragilis (Das, et al. (1984) J. Bacteriol. 
158: 1 165); Kluyveromyces lactis (De Louvencourt et al. (1983) J. Bacteriol. 1 54-131- Van 
den Berg et al (1990) Bio/Technology 5:135); Pichia guillerimondii (Kunze et al. (1985) J. 
Basic Microbiol. 25:141); Pichia pastoris (Cregg, et al. (1985) Mol. Cell. Biol. 5:3376- U S 
Patent Nos. 4,837,148 and 4,929,555); Saccharomyces cerevisiae (Hinnen et al. (1978) Proc 
Natl. Acad. Sci. USA 75:1929; Ito et al. (1983) J. Bacteriol. 153:163); Schizosaccharomyces 
pombe (Beach and Nurse (1981) Nature 300:106); and Yarrowia lipolytica (Davidow, et al. 
(1985) Curr. Genet. 70:380471 Gaillardin, et al. (1985) Curr. Genet. 10:49). 

Methods of introducing exogenous DNA into yeast hosts are well-known in the art 
and usually include either the transformation of spheroplasts or of intact yeast cells treated 
with alkali cations. Transformation procedures usually vary with the yeast species to be 
transformed. See e.g., [Kurtz et al. (1 986) Mol. Cell. Biol. 6: 1 42; Kunze et al. (1985) J. Basic 
Microbiol. 25:141; Candida]; [Gleeson et al. (1986) J. Gen. Microbiol. 132:3459- 
Roggenkamp et al. (1986) Mol. Gen. Genet. 202:302; Hansenula]; [Das et al (1984)./. 
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Bacterial 755:1 165; De Louvencourt et al. (1983) J. Bacteriol. 154:1165; Van den Berg et al. 
(1990) Bio/Technology 5:135; Kluyveromyces]; [Cregg et al. (1985) Mol. Cell. Biol. 5:3376; 
Kunze et al. (1985) J. Basic Microbiol. 25:141; U.S. Patent Nos. 4,837,148 and 4,929,555; 
Pichia]; [Hinnen et al. (1978) Proc. Natl. Acad. Sci. USA 75;1929; Ito et al. (1983) J. 
Bacteriol. 753:163 Saccharomyces]; [Beach and Nurse (1981) Nature 300:706; 
Schizosaccharomyces]; [Davidow et al. (1985) Curr. Genet. 1 0:39; Gaillardin et al. (1985) 
Curr. Genet. 10:49; Yarrowia]. 

Definitions 

A composition containing X is "substantially free of Y when at least 85% by weight 
of the total X+Y in the composition is X. Preferably, X comprises at least about 90% by 
weight of the total of X+Y in the composition, more preferably at least about 95% or even 
99% by weight. 

A "conserved" Neisseria amino acid fragment or protein is one that is present in a 
particular Neisserial protein in at least x% of Neisseria. The value of x may be 50% or more, 
e.g., 66%, 75%, 80%, 90%, 95% or even 100% (i.e. the amino acid is found in the protein in 
question in all Neisseria). In order to determine whether an animo acid is "conserved" in a 
particular Neisserial protein, it is necessary to compare that amino acid residue in the 
sequences of the protein in question from a plurality of different Neisseria (a reference 
population). The reference population may include a number of different Neisseria species 
or may include a single species. The reference population may include a number of different 
serogroups of a particular species or a single serogroup. A preferred reference population 
consists of the 5 most common Neisseria strains. 

The term "heterologous" refers to two biological components that are not found 
together in nature. The components may be host cells, genes, or regulatory regions, such as 
promoters. Although the heterologous components are not found together in nature, they can 
function together, as when a promoter heterologous to a gene is operably linked to the gene. 
Another example is where a Neisserial sequence is heterologous to a mouse host cell. 

"Epitope" means antigenic determinant, and may elicit a cellular and/or humoral 
response. 

Conditions for "high stringency" are 65 degrees C in 0.1 xSSC 0.5% SDS solution. 
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An "origin of replication" is a polynucleotide sequence that initiates and regulates 
replication of polynucleotides, such as an expression vector. The origin of replication behaves 
as an autonomous unit of polynucleotide replication within a cell, capable of replication under 
its own control. An origin of replication may be needed for a vector to replicate in a particular 
host cell. With certain origins of replication, an expression vector can be reproduced at a high 
copy number in the presence of the appropriate proteins within the cell. Examples of origins 
are the autonomously replicating sequences, which are effective in yeast; and the viral 
T-antigen, effective in COS-7 cells. 

A "mutant" sequence is defined as a DNA, RNA or amino acid sequence differing 
from but having homology with the native or disclosed sequence. Depending on the 
particular sequence, the degree of homology (sequence identity) between the native or 
disclosed sequence and the mutant sequence is preferably greater than 50% (e.g., 60% 70% 
80%, 90%, 95%, 99% or more) which is calculated as described above. As used herein, an ' 
"allelic variant" of a nucleic acid molecule, or region, for which nucleic acid sequence is 
provided herein is a nucleic acid molecule, or region, that occurs at essentially the same locus 
in the genome of another or second isolate, and that, due to natural variation caused by, for 
example, mutation or recombination, has a similar but not identical nucleic acid sequence. A 
coding region allelic variant typically encodes a protein having similar activity to that of the 
protein encoded by the gene to which it is being compared. An allelic variant can also 
comprise an alteration in the 5' or 3' untranslated regions of the gene, such as in regulatory 
control regions, (see, for example, U.S. Patent 5,753,235). 

Antibodies 

As used herein, the term "antibody" refers to a polypeptide or group of polypeptides 
composed of at least one antibody combining site. An "antibody combining site" is the 
three-dimensional binding space with an internal surface shape and charge distribution 
complementary to the features of an epitope of an antigen, which allows a binding of the 
antibody with the antigen. "Antibody" includes, for example, vertebrate antibodies, hybrid 
antibodies, chimeric antibodies, humanized antibodies, altered antibodies, univalent 
antibodies, Fab proteins, and single domain antibodies. 

Antibodies against the proteins of the invention are useful for affinity 
chromatography, immunoassays, and distinguishing/identifying Neisseria menB proteins. 
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Antibodies elicited against the proteins of the present invention bind to antigenic polypeptides 
or proteins or protein fragments that are present and specifically associated with strains of 
Neisseria meningitidis menB. In some instances, these antigens may be associated with 
specific strains, such as those antigens specific for the menB strains. The antibodies of the 
invention may be immobilized to a matrix and utilized in an immunoassay or on an affinity 
chromatography column, to enable the detection and/or separation of polypeptides, proteins or 
protein fragments or cells comprising such polypeptides, proteins or protein fragments. 
Alternatively, such polypeptides, proteins, or protein fragments may be immobilized so as to 
detect antibodies bindably specific thereto. 

Antibodies to the proteins of the invention, both polyclonal and monoclonal, may be 
prepared by conventional methods. In general, the protein is first used to immunize a suitable 
animal, preferably a mouse, rat, rabbit or goat. Rabbits and goats are preferred for the 
preparation of polyclonal sera due to the volume of serum obtainable, and the availability of 
labeled anti-rabbit and anti-goat antibodies. Immunization is generally performed by mixing 
or emulsifying the protein in saline, preferably in an adjuvant such as Freund's complete 
adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously or 
intramuscularly). A dose of 50-200 ^g/injection is typically sufficient. Immunization is 
generally boosted 2-6 weeks later with one or more injections of the protein in saline, 
preferably using Freund's incomplete adjuvant. One may alternatively generate antibodies by> ; 
in vitro immunization using methods known in the art, which for the purposes of this 
invention is considered equivalent to in vivo immunization. Polyclonal antisera is obtained by 
bleeding the immunized animal into a glass or plastic container, incubating the blood at 25°C 
for one hour, followed by incubating at 4°C for 2-18 hours. The serum is recovered by 
centrifugation (e.g., 1,000# for 10 minutes). About 20-50 ml per bleed may be obtained from 
rabbits. 

Monoclonal antibodies are prepared using the standard method of Kohler & Milstein 
{Nature (1975) 256:495-96), or a modification thereof. Typically, a mouse or rat is 
immunized as described above. However, rather than bleeding the animal to extract serum, 
the spleen (and optionally several large lymph nodes) is removed and dissociated into single 
cells. If desired, the spleen cells may be screened (after removal of nonspecifically adherent 
cells) by applying a cell suspension to a plate or well coated with the protein antigen. B-cells 
that express membrane-bound immunoglobulin specific for the antigen bind to the plate, and 
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are not rinsed away with the rest of the suspension. Resulting B-cells, or all dissociated spleen 
cells, are then induced to fuse with myeloma cells to form hybridomas, and are cultured in a 
selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, "HAT"). The 
resulting hybridomas are plated by limiting dilution, and are assayed for the production of 
antibodies which bind specifically to the immunizing antigen (and which do not bind to 
unrelated antigens). The selected MAb-secreting hybridomas are then cultured either in vitro 
(e.g., in tissue culture bottles or hollow fiber reactors), or in vivo (as ascites in mice). 

If desired, the antibodies (whether polyclonal or monoclonal) may be labeled using 
conventional techniques. Suitable labels include fluorophores, chromophores, radioactive 
atoms (particularly 32 P and 125 I), electron-dense reagents, enzymes, and iigands having 
specific binding partners. Enzymes are typically detected by their activity. For example, 
horseradish peroxidase is usually detected by its ability to convert 
3^5,5'-tetramemylbenzicune (TMB) to a blue pigment, quantifiable with a 
spectrophotometer. "Specific binding partner" refers to a protein capable of binding a ligand 
molecule with high specificity, as for example in the case of an antigen and a monoclonal 
antibody specific therefor. Other specific binding partners include biotin and avidin or 
streptavidin, IgG and protein A, and the numerous receptor-ligand couples known in the art. It 
should be understood that the above description is not meant to categorize the various labels 
into distinct classes, as the same label may serve in several different modes. For example, ,25 I 
may serve as a radioactive label or as an electron-dense reagent. HRP may serve as enzyme or 
as antigen for a MAb. Further, one may combine various labels for desired effect. For 
example, MAbs and avidin also require labels in the practice of this invention: thus, one 
might label a MAb with biotin, and detect its presence with avidin labeled with 125 I,' or with 
an anti-biotin MAb labeled with HRP. Other permutations and possibilities will be readily 
apparent to those of ordinary skill in the art, and are considered as equivalents within the 
scope of the instant invention. 

Antigens, immunogens, polypeptides, proteins or protein fragments of the present 
invention elicit formation of specific binding partner antibodies. These antigens, 
immunogens, polypeptides, proteins or protein fragments of the present invention comprise 
immunogenic compositions of the present invention. Such immunogenic compositions may 
further comprise or include adjuvants, carriers, or other compositions that promote or enhance 
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or stabilize the antigens, polypeptides, proteins or protein fragments of the present invention. 
Such adjuvants and carriers will be readily apparent to those of ordinary skill in the art. 

Pharmaceutical Compositions 

Pharmaceutical compositions can comprise (include) either polypeptides, antibodies, 
or nucleic acid of the invention. The pharmaceutical compositions will comprise a 
therapeutically effective amount of either polypeptides, antibodies, or polynucleotides of the 
claimed invention. 

The term "therapeutically effective amount" as used herein refers to an amount of a 
therapeutic agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a 
detectable therapeutic or preventative effect. The effect can be detected by, for example, 
chemical markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature, when given to a patient that is febrile. The 
precise effective amount for a subject will depend upon the subject's size and health, the 
nature and extent of the condition, and the therapeutics or combination of therapeutics 
selected for administration. Thus, it is not useful to specify an exact effective amount in 
advance. However, the effective amount for a given situation can be determined by routine 
experimentation and is within the judgment of the clinician. 

For purposes of the present invention, an effective dose will be from about 0.01 mg/ ■ t 
kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the DNA constructs in the individual to 
which it is administered. 

A pharmaceutical composition can also contain a pharmaceutically acceptable carrier. 
The term "pharmaceutically acceptable carrier" refers to a carrier for administration of a 
therapeutic agent, such as antibodies or a polypeptide, genes, and other therapeutic agents. 
The term refers to any pharmaceutical carrier that does not itself induce the production of 
antibodies harmful to the individual receiving the composition, and which may be 
administered without undue toxicity. Suitable carriers may be large, slowly metabolized 
macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, 
polymeric amino acids, amino acid copolymers, and inactive virus particles. Such carriers are 
well known to those of ordinary skill in the art. 

Pharmaceutically acceptable salts can be used therein, for example, mineral acid salts 
such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of 
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organic acids such as acetates, propionates, malonates, benzoates, and the like. A thorough 
discussion of pharmaceutically acceptable excipients is available in Remington's 
Pharmaceutical Sciences (Mack Pub. Co., N.J. 1991). 

Pharmaceutically acceptable carriers in therapeutic compositions may contain liquids 
such as water, saline, glycerol and ethanol. Additionally, auxiliary substances, such as wetting 
or emulsifying agents, P H buffering substances, and the like, may be present in such vehicles. 
Typically, the therapeutic compositions are prepared as injectables, either as liquid solutions 
or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to 
injection may also be prepared. Liposomes are included within the definition of a 
pharmaceutically acceptable carrier. 



Delivery Methods 

Once formulated, the compositions of the invention can be administered directly to the 
subject. The subjects to be treated can be animals; in particular, human subjects can be 



treated. 



Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneally, intravenously or intramuscularly or delivered to the ' 
interstitial space of a tissue. The compositions can also be administered into a lesion. Other 
modes of administration include oral and pulmonary administration, suppositories, and 
transdermal and transcutaneous applications, needles, and gene guns or hyposprays. Dosage 
treatment may be a single dose schedule or a multiple dose schedule. 

Vaccines 

Vaccines according to the invention may either be prophylactic (i.e., to prevent 
infection) or therapeutic (i.e., to treat disease after infection). 

Such vaccines comprise immunizing antigen(s) or immunogen(s), immunogenic 
polypeptide, protein(s) or protein fragments, or nucleic acids (e.g., ribonucleic acid or 
deoxyribonucleic acid), usually in combination with "pharmaceutically acceptable carriers," 
which include any carrier that does not itself induce the production of antibodies harmful to 
the individual receiving the composition. Suitable carriers are typically large, slowly 
metabolized macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic 
acids, polymeric amino acids, amino acid copolymers, lipid aggregates (such as oil droplets or 
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liposomes), and inactive virus particles. Such carriers are well known to those of ordinary 
skill in the art. Additionally, these carriers may function as immunostimulating agents 
("adjuvants"). Furthermore, the immunogen or antigen may be conjugated to a bacterial 
toxoid, such as a toxoid from diphtheria, tetanus, cholera, H. pylori, etc. pathogens. 

Preferred adjuvants to enhance effectiveness of the composition include, but are not 
limited to: (1) aluminum salts (alum), such as aluminum hydroxide, aluminum phosphate, 
aluminum sulfate, etc; (2) oil-in-water emulsion formulations (with or without other specific 
immunostimulating agents such as muramyl peptides (see below) or bacterial cell wall 
components), such as for example (a) MF59 (PCT Publ. No. WO 90/14837), containing 5% 
Squalene, 0.5% Tween 80, and 0.5% Span 85 (optionally containing various amounts of 
MTP-PE (see below), although not required) formulated into submicron particles using a 
microfluidizer such as Model HOY microfluidizer (Microfluidics, Newton, MA), (b) SAF, 
containing 10% Squalane, 0.4% Tween 80, 5% pluronic-blocked polymer L121, and thr-MDP 
(see below) either microfluidized into a submicron emulsion or vortexed to generate a larger 
particle size emulsion, and (c) Ribi™ adjuvant system (RAS), (Ribi Immunochem, Hamilton, 
MT) containing 2% Squalene, 0.2% Tween 80, and one or more bacterial cell wall 
components from the group consisting of monophosphorylipid A (MPL), trehalose 
dimycolate (TDM), and cell wall skeleton (CWS), preferably MPL + CWS (Detox™); 

(3) saponin adjuvants, such as Stimulon™ (Cambridge Bioscience, Worcester, MA) may be^ 
used or particles generated therefrom such as ISCOMs (immunostimulating complexes); 

(4) Complete Freund's Adjuvant (CFA) and Incomplete Freund's Adjuvant (IF A); 

(5) cytokines, such as interleukins (e.g., IL-1, IL-2, IL-4, IL-5, IL-6, IL-7, IL-12, etc.), 
interferons (e.g., gamma interferon), macrophage colony stimulating factor (M-CSF), tumor 
necrosis factor (TNF), etc; (6) detoxified mutants of a bacterial ADP-ribosylating toxin such 
as a cholera toxin (CT), a pertussis toxin (PT), or an E. coli heat-labile toxin (LT), particularly 
LT-K63, LT-R72, CT-S109, PT-K9/G129; see, e.g., WO 93/13302 and WO 92/19265; and 
(7) other substances that act as immunostimulating agents to enhance the effectiveness of the 
composition. Alum and MF59 are preferred. 

As mentioned above, muramyl peptides include, but are not limited to, N-acetyl- 
muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-normuramyl-L-alanyl-D- 
isoglutamine (nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(l , -2'- 
dipalmitoyl- < sn-glycero-3-huydroxyphosphoryloxy)-ethylamine (MTP-PE), etc. 
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The vaccine compositions comprising immunogenic compositions (e.g., which may 
include the antigen, pharmaceutical^ acceptable carrier, and adjuvant) typically will contain 
diluents, such as water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances such 
as wetting or emulsifying agents, pH buffering substances, and the like, may be present in 
such vehicles. Alternatively, vaccine compositions comprising immunogenic compositions 
may comprise an antigen, polypeptide, protein, protein fragment or nucleic acid in a 
pharmaceutically acceptable carrier. 

More specifically, vaccines comprising immunogenic compositions comprise an 
immunologically effective amount of the immunogenic polypeptides, as well as any other of 
the above-mentioned components, as needed. By "immunologically effective amount", it is 
meant that the administration of that amount to an individual, either in a single dose or'as part 
of a series, is effective for treatment or prevention. This amount varies depending upon the 
health and physical condition of the individual to be treated, the taxonomic group of 
individual to be treated (e.g.. nonhuman primate, primate, etc.), the capacity of the 
individual's immune system to synthesize antibodies, the degree of protection desired, the 
formulation of the vaccine, the treating doctor's assessment of the medical situation, and other 
relevant factors. It is expected that the amount will fall in a relatively broad range that can be 
determined through routine trials. 

Typically, the vaccine compositions or immunogenic compositions are prepared as 
mjectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or 
suspension in, liquid vehicles prior to injection may also be prepared. The preparation aiso 
may be emulsified or encapsulated in liposomes for enhanced adjuvant effect, as discussed 
above under pharmaceutically acceptable carriers. 

The immunogenic compositions are conventionally administered parenterally, e.g., by 
injection, either subcutaneously or intramuscularly. Additional formulations suitablefor other 
modes of administration include oral and pulmonary formulations, suppositories, and 
transdermal and transcutaneous applications. Dosage treatment may be a single dose schedule 
or a multiple dose schedule. The vaccine may be administered in conjunction with other 
immunoregulatory agents. 

As an alternative to protein-based vaccines, DNA vaccination may be employed (e g 
Robmson & Torres (1997) Seminars in Immunology 9:271-283; Donnelly et al (1997) Annu 

Rev Immunol 15:617-648). 
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Gene Delivery Vehicles 

Gene therapy vehicles for delivery of constructs, including a coding sequence of a 
therapeutic of the invention, to be delivered to the mammal for expression in the mammal, can 
be administered either locally or systemically. These constructs can utilize viral or non-viral 
vector approaches in in vivo or ex vivo modality. Expression of such coding sequence can be 
induced using endogenous mammalian or heterologous promoters. Expression of the coding 
sequence in vivo can be either constitutive or regulated. 

The invention includes gene delivery vehicles capable of expressing the contemplated 
nucleic acid sequences. The gene delivery vehicle is preferably a viral vector and, more 
preferably, a retroviral, adenoviral, adeno-associated viral (AAV), herpes viral, or alphavirus 
vector. The viral vector can also be an astrovirus, coronavirus, orthomyxovirus, papovavirus, 
paramyxovirus, parvovirus, picornavirus, poxvirus, or togavirus viral vector. See generally, 
Jolly (1994) Cancer Gene Therapy 1 :51-64; Kimura (1994) Human Gene Therapy 5:845-852; 
Connelly (1995) Human Gene Therapy 6:185-193; and Kaplitt (1994) Nature Genetics 
6:148-153. 

Retroviral vectors are well known in the art, including B, C and D type retroviruses, 
xenotropic retroviruses (for example, NZB-X1, NZB-X2 and NZB9-1 (see O'Neill (1985) J. 
Virol 53:160) polytropic retroviruses e.g., MCF and MCF-MLV (see Kelly (1983) J. Virok 
45:291), spumaviruses and lentiviruses. See RNA Tumor Viruses, Second Edition, Cold 
Spring Harbor Laboratory, 1985. 

Portions of the retroviral gene therapy vector may be derived from different 
retroviruses. For example, retrovector LTRs may be derived from a Murine Sarcoma Virus, a 
tRNA binding site from a Rous Sarcoma Virus, a packaging signal from a Murine Leukemia 
Virus, and an origin of second strand synthesis from an Avian Leukosis Virus. 

These recombinant retroviral vectors may be used to generate transduction competent 
retroviral vector particles by introducing them into appropriate packaging cell lines (see US 
patent 5,591,624). Retrovirus vectors can be constructed for site-specific integration into host 
cell DNA by incorporation of a chimeric integrase enzyme into the retroviral particle (see 
W096/37626). It is preferable that the recombinant viral vector is a replication defective 
recombinant virus. 
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Packaging cell lines suitable for use with the above-described retrovirus vectors are 
well known in the art, are readily prepared (see WO95/30763 and WO92/05266), and can be . 
used to create producer cell lines (also termed vector cell lines or "VCLs") for the production 
of recombinant vector particles. Preferably, the packaging cell lines are made from human 
parent cells (e.g., HT1080 cells) or mink parent cell lines, which eliminates inactivation in 
human serum. 

Preferred retroviruses for the construction of retroviral gene therapy vectors include 
Avian Leukosis Virus, Bovine Leukemia, Virus, Murine Leukemia Virus, Mink-Cell 
Focus-Inducing Virus, Murine Sarcoma Virus, Reticuloendotheliosis Virus and Rous 
Sarcoma Virus. Particularly preferred Murine Leukemia Viruses include 4070A and 1504A 
(Hartley and Rowe (1976) J Virol 19.19-25), Abelson (ATCC No. VR-999) Friend (ATCC 
No. VR-245), Graffi, Gross (ATCC Nol VR-590), Kirsten, Harvey Sarcoma Virus and 
Rauscher (ATCC No. VR-998) and Moloney Murine Leukemia Virus (ATCC No. VR-190) 
Such retroviruses may be obtained from depositories or collections such as the American 
Type Culture Collection ("ATCC") in Rockville, Maryland or isolated from known sources 
using commonly available techniques. 

Exemplary known retroviral gene therapy vectors employable in this invention include 
those described in patent applications GB2200651, EP0415731, EP0345242 EP0334301 
WO89/02468; WO89/05349, WO89/09271, WO90/02806, WO90/07936 WO94/03622 ' 
WQ93/25698, WQ93/25234, WO93/11230, WO93/10218, WO91/02805, WO91/0282 5 ' 
WO95/07994, US 5,219,740, US 4,405,712, US 4,861,719, US 4,980,289, US 4,777,127, US 
5,591,624. See also Vile (1993) Cancer Res 53:3860-3864; Vile (1993) Cancer Res 
53:962-967; Ram (1993) Cancer Res 53 (1993) 83-88; Takamiya (1992) J Neurosci Res 
33:493-503; Baba (1993) JNeurosurg 79:729-735; Mann (1983) Cell 33:153; Cane (1984) 
Proc Natl Acad Sci 81:6349; and Miller (1990) Human Gene Therapy 1. 

Human adenoviral gene therapy vectors are also known in the art and employable in 
this invention. See, for example, Berkner (1988) Biotechniques 6:616 and Rosenfeld (1991) 
Science 252:43 1, and WO93/07283, WO93/06223, and WO93/07282. Exemplary known 
adenoviral gene therapy vectors employable in this invention include those described in the 
above referenced documents and in W094/12649, WO93/03769, W093/19191 
WQ94/28938, WQ95/11984, WO95/00655, WO95/27071, W095/29993 W095/34671 
WO96/05320, WO94/08026, WO94/11506, WO93/06223, WQ94/24299, WO95/14102,' 
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W095/24297, WO95/02697, W094/28152, W094/24299, WO95/09241, WO95/25807, 
WO95/05835, W094/18922 and WO95/09654. Alternatively, administration of DNA linked 
to killed adenovirus as described in Curiel (1992) Hum. Gene Ther. 3:147-154 may be 
employed. The gene delivery vehicles of the invention also include adenovirus associated 
virus (AAV) vectors. Leading and preferred examples of such vectors for use in this invention 
are the AAV-2 based vectors disclosed in Srivastava, WO93/09239. Most preferred AAV 
vectors comprise the two AAV inverted terminal repeats in which the native D-sequences are 
modified by substitution of nucleotides, such that at least 5 native nucleotides and up to 18 
native nucleotides, preferably at least 10 native nucleotides up to 18 native nucleotides, most 
preferably 10 native nucleotides are retained and the remaining nucleotides of the D-sequence 
are deleted or replaced with non-native nucleotides. The native D-sequences of the AAV 
inverted terminal repeats are sequences of 20 consecutive nucleotides in each AAV inverted 
terminal repeat (i.e., there is one sequence at each end) which are not involved in HP 
formation. The non-native replacement nucleotide may be any nucleotide other than the 
nucleotide found in the native D-sequence in the same position. Other employable exemplary 
AAV vectors are pWP-19, pWN-1, both of which are disclosed in Nahreini (1993) Gene 
124:257-262. Another example of such an AAV vector is psub201 (see Samulski (1987) J. 
Virol. 61 :3096). Another exemplary AAV vector is the Double-D ITR vector. Construction of 
the Double-D ITR vector is disclosed in US Patent 5,478,745. Still other vectors are those^ 
disclosed in Carter US Patent 4,797,368 and Muzyczka US Patent 5,139,941, Chartejee US 
Patent 5,474,935, and Kotin W094/288157. Yet a further example of an AAV vector 
employable in this invention is SSV9AFABTKneo, which contains the AFP enhancer and 
albumin promoter and directs expression predominantly in the liver. Its structure and 
construction are disclosed in Su (1996) Human Gene Therapy 7:463-470. Additional AAV 
gene therapy vectors are described in US 5,354,678, US 5,173,414, US 5,139,941, and US 
5,252,479. 

The gene therapy vectors comprising sequences of the invention also include herpes 
vectors. Leading and preferred examples are herpes simplex virus vectors containing a 
sequence encoding a thymidine kinase polypeptide such as those disclosed in US 5,288,641 
and EP0176170 (Roizman). Additional exemplary herpes simplex virus vectors include 
HFEM/ICP6-LacZ disclosed in WO95/04139 (Wistar Institute), pHSVlac described in Geller 
(1988) Science 241 :1667-1669 and in WO90/09441 and WO92/07945, HSV Us3::pgC-lacZ 
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described in Fink (1992) Human Gene Therapy 3:1 1-19 and HSV 7134, 2 RH 105 and GAL4 
described in EP 0453242 (Breakefield), and those deposited with the ATCC as accession 
numbers ATCC VR-977 and ATCC VR-260. 

Also contemplated are alpha virus gene therapy vectors that can be employed in this 
invention. Preferred alpha virus vectors are Sindbis viruses vectors. Togaviruses, Semliki 
Forest virus (ATCC VR-67; ATCC VR-1247), Middleberg virus (ATCC VR-370), Ross 
River virus (ATCC VR-373; ATCC VR-1246), Venezuelan equine encephalitis virus (ATCC 
VR923; ATCC VR-1250; ATCC VR-1249; ATCC VR-532), and those described in US 
patents 5,091,309, 5,217,879, and WO92/10578. More particularly, those alpha virus vectors 
described in U.S. Serial No. 08/405,627, filed March 15, 1995,W094/21792 WO92/10578 
WO95/07994, US 5,091,309 and US 5,217,879 are employable. Such alpha viruses may be' 
obtained from depositories or collections such as the ATCC in Rockville, Maryland or 
isolated from known sources using commonly available techniques. Preferably, alphavirus 
vectors with reduced cytotoxicity are used (see USSN 08/679640). 

DNA vector systems such as eukarytic layered expression systems are also useful for 
expressing the nucleic acids of the invention. SeeWO95/07994 for a detailed description of 
eukaryotic layered expression systems. Preferably, the eukaryotic layered expression systems 
of the invention are derived from alphavirus vectors and most preferably from Sindbis viral 
vectors. 

Other viral vectors suitable for use in the present invention include those derived from 
poliovirus, for example ATCC VR-58 and those described in Evans, Nature 339 (1989) 385 
and Sabin (1973) J. Biol. Standardization 1:115; rhinovirus, for example ATCC VR-1 1 10 and 
those described in Arnold (1990) J Cell Biochem L401; pox viruses such as canary pox virus 
or vaccinia virus, for example ATCC VR-1 1 1 and ATCC VR-2010 and those described in 
Fisher-Hoch (1989) Proc Natl Acad Sci 86:317; Flexner (1989) Ann NY Acad Sci 569:86, 
Flexner (1990) Vaccine 8:17; in US 4,603,1 12 and US 4,769,330 and WO89/01973; SV40 
virus, for example ATCC VR-305 and those described in Mulligan (1979) Nature 277:108 
and Madzak (1992) J Gen Virol 73:1533; influenza virus, for example ATCC VR-797 and 
recombinant influenza viruses made employing reverse genetics techniques as described in 
US 5,166,057 and in Enami (1990) Proc Natl Acad Sci 87:3802-3805; Enami & Palese (1991) 
J Virol 65:271 1-2713 and Luytjes (1989) Cell 59:1 10, (see also McMichael (1983) NEJMed 
309:13, and Yap (1978) Nature 273:238 and Nature (1979) 277:108); human 
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immunodeficiency virus as described in EP-0386882 and in Buchschacher (1992) J. Virol 
66:2731; measles virus, for example ATCC VR-67 and VR-1247 and those described in EP- 
0440219; Aura virus, for example ATCC VR-368; Bebaru virus, for example ATCC VR-600 
and ATCC VR-1240; Cabassou virus, for example ATCC VR-922; Chikungunya virus, for 
example ATCC VR-64 and ATCC VR-1241; Fort Morgan Virus, for example ATCC 
VR-924; Getah virus, for example ATCC VR-369 and ATCC VR-1243; Kyzylagach virus, 
for example ATCC VR-927; Mayaro virus, for example ATCC VR-66; Mucambo virus, for 
example ATCC VR-580 and ATCC VR-1244; Ndumu virus, for example ATCC VR-371; 
Pixuna virus, for example ATCC VR-372 and ATCC VR-1245; Tonate virus, for example 
ATCC VR-925; Triniti virus, for example ATCC VR-469; Una virus, for example ATCC 
VR-374; Whataroa virus, for example ATCC VR-926; Y-62-33 virus, for example ATCC 
VR-375; O'Nyong virus, Eastern encephalitis virus, for example ATCC VR-65 and ATCC 
VR-1242; Western encephalitis virus, for example ATCC VR-70, ATCC VR-1251, ATCC 
VR-622 and ATCC VR-1252; and coronavirus, for example ATCC VR-740 and those 
described in Hamre (1966) Proc Soc Exp Biol Med 121 : 190. 

Delivery of the compositions of this invention into cells is not limited to the above 
mentioned viral vectors. Other delivery methods and media may be employed such as, for 
example, nucleic acid expression vectors, polycationic condensed DNA linked or unlinked to 
killed adenovirus alone, for example see US Serial No. 08/366,787, filed December 30, 1994 
and Curiel (1992) Hum Gene Ther 3:147-154 ligand linked DNA, for example see Wu (1989) 
J Biol Chem 264:16985-16987, eucaryotic cell delivery vehicles cells, for example see US 
Serial No.08/240,030, filed May 9, 1994, and US Serial No. 08/404,796, deposition of 
photopolymerized hydrogel materials, hand-held gene transfer particle gun, as described in 
US Patent 5,149,655, ionizing radiation as described in US5,206,152 and in W092/1 1033, 
nucleic charge neutralization or fusion with cell membranes. Additional approaches are 
described in Philip (1994) Mol Cell Biol 14:241 1-241 8 and in Woffendin (1994) Proc Natl 
Acad Sci 91:1581-1585. 

Particle mediated gene transfer may be employed, for example see US Serial No. 
60/023,867. Briefly, the sequence can be inserted into conventional vectors that contain 
conventional control sequences for high level expression, and then incubated with synthetic 
gene transfer molecules such as polymeric DNA-binding cations like polylysine, protamine, 
and albumin, linked to cell targeting ligands such as asialoorosomucoid, as described in Wu & 
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Wu (1987) J. Biol. Chem. 262:4429-4432, insulin as described in Hucked (1990) Biochem 
Pharmacol 40:253-263, galactose as described in Plank (1992) Bioconjugate Chem 
3:533-539, lactose or transferrin. 

Naked DNA may also be employed to transform a host cell. Exemplary naked DNA 
introduction methods are described in WO 90/1 1092 and US 5,580,859. Uptake efficiency 
may be improved using biodegradable latex beads. DNA coated latex beads are efficiently 
transported into cells after endocytosis initiation by the beads. The method may be improved 
further by treatment of the beads to increase hydrophobic^ and thereby facilitate disruption 
of the endosome and release of the DNA into the cytoplasm. 

Liposomes that can act as gene delivery vehicles are described in U S 5 422 120 
W095/13796, W094/23697, W091/14445 and EP-524,968. As described in USSR 
60/023,867, on non-viral delivery, the nucleic acid sequences encoding a polypeptide can be 
inserted into conventional vectors that contain conventional control sequences for high level 
expression, and then be incubated with synthetic gene transfer molecules such as polymeric 
DNA-binding cations like polylysine, protamine, and albumin, linked to cell targeting ligands 
such as asialoorosomucoid, insulin, galactose, lactose, or transferrin. Other delivery systems 
include the use of liposomes to encapsulate DNA comprising the gene under the control of a 
variety of tissue-specific or ubiquitously-active promoters. Further non-viral delivery suitable 
for use includes mechanical delivery systems such as the approach described in Woffendin et 
al (1994) Proc. Natl. Acad. Sci. USA 91(24):! 1581-1 1585. Moreover, the coding sequence 
and the product of expression of such can be delivered through deposition of 
photopolymerized hydrogel materials. Other conventional methods for gene delivery that can 
be used for delivery of the coding sequence include, for example, use of hand-held gene 
transfer particle gun, as described in U.S. 5,149,655; use of ionizing radiation for activating 
transferred gene, as described in U.S. 5,206, 1 52 and W092/1 1 033. 

Exemplary liposome and polycationic gene delivery vehicles are those described in 
US 5,422,120 and 4,762,915; inWO 95/13796; W094/23697; and W091/14445; in EP- 
0524968; and in Stryer, Biochemistry, pages 236-240 (1975) W.H. Freeman, San Francisco; 
Szoka (1980) Biochem Biophys Acta 600:1; Bayer (1979) Biochem Biophys Acta 550 464- ' 
Rivnay (1987) MethEnzymol 149:1 19; Wang (1987) Proc Natl Acad Sci 84:7851; Plant ' 
( 1 989) Anal Biochem 1 76:420. 



BNSDOCID: <WO 895728QA2_I_> 



WO 99/57280 




PCT/US99/09346 



A polynucleotide composition can comprises therapeutically effective amount of a 
gene therapy vehicle, as the term is defined above. For purposes of the present invention, an 
effective dose will be from about 0.01 mg/ kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg 
of the DNA constructs in the individual to which it is administered. 

Delivery Methods 

Once formulated, the polynucleotide compositions of the invention can be 
administered (1) directly to the subject; (2) delivered ex vivo, to cells derived from the 
subject; or (3) in vitro for expression of recombinant proteins. The subjects to be treated can 
be mammals or birds. Also, human subjects can be treated. 

Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneal^, intravenously or intramuscularly or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a tumor or lesion. 
Other modes of administration include oral and pulmonary administration, suppositories, and 
transdermal applications, needles, and gene guns or hyposprays. Dosage treatment may be a 
single dose schedule or a multiple dose schedule. 

Methods for the ex vivo delivery and reimplantation of transformed cells into a subject 
are known in the art and described in eg. W093/14778. Examples of cells useful in ex vivo 
applications include, for example, stem cells, particularly hematopoetic, lymph cells, 
macrophages, dendritic cells, or tumor cells. 

Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 
accomplished by the following procedures, for example, dextran-mediated transfection, 
calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
electroporation, encapsulation of the polynucleotide(s) in liposomes, and direct microinjection 
of the DNA into nuclei, all well known in the art. 

Polynucleotide and polypeptide pharmaceutical compositions 

In addition to the pharmaceutically acceptable carriers and salts described above, the 
following additional agents can be used with polynucleotide and/or polypeptide compositions. 
A.Polypeptides 

One example are polypeptides which include, without limitation: asioloorosomucoid 
(ASOR); transferrin; asialoglycoproteins; antibodies; antibody fragments; ferritin; 
interleukins; interferons, granulocyte, macrophage colony stimulating factor (GM-CSF), 
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granulocyte colony stimulating factor (G-CSF), macrophage colony stimulating factor 
(M-CSF), stem cell factor and erythropoietin. Viral antigens, such as envelope proteins, can 
also be used. Also, proteins from other invasive organisms, such as the 1 7 amino acid peptide 
from the circumsporozoite protein of Plasmodium falciparum known as RH. 
B. Hormones, Vitamins, Etc. 

Other groups that can be included are, for example: hormones, steroids, androgens, 
estrogens, thyroid hormone, or vitamins, folic acid. 
CPolyalkylenes, Polysaccharides, etc. 

Also, polyalkylene glycol can be included with the desired polynucleotides or 
polypeptides. In a preferred embodiment, the polyalkylene glycol is polyethlylene glycol. In 
addition, mono-, di-, or polysaccarides can be included. In a preferred embodiment of this 
aspect, the polysaccharide is dextran or DEAE-dextran. Also, chitosan and 
poly(lactide-co-glycolide) 
D.Lipids, and Liposomes 

The desired polynucleotide or polypeptide can also be encapsulated in lipids or 
packaged in liposomes prior to delivery to the subject or to cells derived therefrom. 

Lipid encapsulation is generally accomplished using liposomes which are able to 
stably bind or entrap and retain nucleic acid. The ratio of condensed polynucleotide or 
polypeptide to lipid preparation can vary but will generally be around 1:1 (mg 
DNA:micromoles lipid), or more of lipid. For a review of the use of liposomes as carriers for 
delivery of nucleic acids, see, Hug and Sleight (1991) Biochim. Biophys. Acta. 1097:1-17; 
Straubinger (1983) Meth. Enzymol. 101:512-527. 

Liposomal preparations for use in the present invention include cationic (positively 
charged), anionic (negatively charged) and neutral preparations. Cationic liposomes have 
been shown to mediate intracellular delivery of plasmid DNA (Feigner (1987) Proc. Natl. 
Acad. Sci. USA 84:7413-7416); mRNA (Malone (1989) Proc. Natl. Acad. Sci. USA 
86:6077-6081); and purified transcription factors (Debs (1990) J. Biol. Chem. 
265:10189-10192), in functional form. 

Cationic liposomes are readily available. For example, 
N[l-2,3-dioleyloxy)p rop yl]-N,N,N-triemylammonium (DOTMA) liposomes are available 
under the trademark Lipofectin, from GIBCO BRL, Grand Island, NY. (See, also, Feigner 
supra). Other commercially available liposomes include transferee (DDAB/DOPE) and 
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DOTAP/DOPE (Boerhinger). Other cationic liposomes can be prepared from readily available 
materials using techniques well known in the art. See, eg. Szoka (1978) Proc. Natl Acad. Sci. 
USA 75:4194-4198; WO90/1 1092 for a description of the synthesis of DOTAP 
(1 ,2-bis(oleoyloxy)-3-(trimethylammonio)propane) liposomes. 

Similarly, anionic and neutral liposomes are readily available, such as from Avanti 
Polar Lipids (Birmingham, AL), or can be easily prepared using readily available materials. 
Such materials include phosphatidyl choline, cholesterol, phosphatidyl ethanolamine, 
dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol (DOPG), 
dioleoylphoshatidyl ethanolamine (DOPE), among others. These materials can also be mixed 
with the DOTMA and DOTAP starting materials in appropriate ratios. Methods for making 
liposomes using these materials are well known in the art. 

The liposomes can comprise multilammelar vesicles (MLVs), small unilamellar 
vesicles (SUVs), or large unilamellar vesicles (LUVs). The various liposome-nucleic acid 
complexes are prepared using methods known in the art. See eg. Straubinger (1983) Meth. 
Immunol 101:512-527; Szoka (1978) Proc. Natl Acad, Sci. USA 75:4194-4198; 
Papahadjopoulos (1975) Biochim. Biophys. Acta 394:483; Wilson (1979) Cell 17:77); Deamer 
& Bangham (1976) Biochim. Biophys. Acta 443:629; Ostro (1977) Biochem. Biophys. Res. 
Commun. 76:836; Fraley (1979) Proc. Natl Acad. Sci. USA 76:3348); Enoch & Strittmatter 
(1979) Proc. Natl Acad. Sci. USA 76:145; Fraley (1980) J. Biol Chem. (1980) 255:10431; 
Szoka & Papahadjopoulos (1978) Proc. Natl Acad. Set USA 75: 145; and Schaefer-Ridder 
(1982) Science 215:166. 
E.Lipoproteins 

In addition, lipoproteins can be included with the polynucleotide or polypeptide to be 
delivered. Examples of lipoproteins to be utilized include: chylomicrons, HDL, IDL, LDL, 
and VLDL. Mutants, fragments, or fusions of these proteins can also be used. Also, 
modifications of naturally occurring lipoproteins can be used, such as acetylated LDL. These 
lipoproteins can target the delivery of polynucleotides to cells expressing lipoprotein 
receptors. Preferably, if lipoproteins are including with the polynucleotide to be delivered, no 
other targeting ligand is included in the composition. 

Naturally occurring lipoproteins comprise a lipid and a protein portion. The protein 
portion are known as apoproteins. At the present, apoproteins A, B, C, D, and E have been 
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isolated and identified. At least two of these contain several proteins, designated by Roman 
numerals, AI, All, AIV; CI, CII, CIII. 

A lipoprotein can comprise more than one apoprotein. For example, naturally 
occurring chylomicrons comprises of A, B, C, and E, over time these lipoproteins lose A and 
acquire C and E apoproteins. VLDL comprises A, B, C, and E apoproteins, LDL comprises 
apoprotein B; and HDL comprises apoproteins A, C, and E. 

The amino acid of these apoproteins are known and are described in, for example, 
Breslow (1985) Annu Rev. Biochem 54:699; Law (1986) Adv. Exp Med. Biol. 151:162; Chen 
{1986)JBiol Chem 261:12918; Kane (1980) Proa Natl Acad Sci USA 77:2465; and Utermann 
(1984) Hum Genet 65:232. 

Lipoproteins contain a variety of lipids including, triglycerides, cholesterol (free and 
esters), and phopholipids. The composition of the lipids varies in naturally occurring 
lipoproteins. For example, chylomicrons comprise mainly triglycerides. A more detailed 
description of the lipid content of naturally occurring lipoproteins can be found, for example, 
inMeth. Enzymol. 128 (1986). The composition of the lipids are chosen to aid in 
conformation of the apoprotein for receptor binding activity. The composition of lipids can 
also be chosen to facilitate hydrophobic interaction and association with the polynucleotide 
binding molecule. 

Naturally occurring lipoproteins can be isolated from serum by ultracentrifugation, for 
instance. Such methods are described in Meth. Enzymol. {supra); Pitas (1980) J. Biochem. 
255:5454-5460 and Mahey {1919) J Clin. Invest 64:743-750. 

Lipoproteins can also be produced by in vitro or recombinant methods by expression 
of the apoprotein genes in a desired host cell. See, for example, Atkinson (1986) Annu Rev 
Biophys Chem 15:403 and Radding (1958) Biochim Biophys Acta 30: 443. 

Lipoproteins can also be purchased from commercial suppliers, such as Biomedical 
Techniologies, Inc., Stoughton, Massachusetts, USA. 

Further description of lipoproteins can be found in Zuckermann et al., PCT. Appln 
No. US97/14465. 
F.Polycationic Agents 

Polycationic agents can be included, with or without lipoprotein, in a composition 
with the desired polynucleotide or polypeptide to be delivered. 
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Polycationic agents, typically, exhibit a net positive charge at physiological relevant 
pH and are capable of neutralizing the electrical charge of nucleic acids to facilitate delivery 
to a desired location. These agents have both in vitro, ex vivo, and in vivo applications. 
Polycationic agents can be used to deliver nucleic acids to a living subject either 
intramuscularly, subcutaneously, etc. 

The following are examples of useful polypeptides as polycationic agents: polylysine, 
polyarginine, polyornithine, and protamine. Other examples include histones, protamines, 
human serum albumin, DNA binding proteins, non-histone chromosomal proteins, coat 
proteins from DNA viruses, such as (XI 74, transcriptional factors also contain domains that 
bind DNA and therefore may be useful as nucleic aid condensing agents. Briefly, 
transcriptional factors such as C/CEBP, c-jun, c-fos, AP-1, AP-2, AP-3, CPF, Prot-1, Sp-1, 
Oct-1, Oct-2, CREP, and TFIID contain basic domains that bind DNA sequences. 

Organic polycationic agents include: spermine, spermidine, and purtrescine. 

The dimensions and of the physical properties of a polycationic agent can be 
extrapolated from the list above, to construct other polypeptide polycationic agents or to 
produce synthetic polycationic agents. 

Synthetic Polycationic Agents 

Synthetic polycationic agents which are useful include, for example, DEAE-dextran, 
polybrene. LipofectinD, and lipofect AMINE □ are monomers that form polycationic 
complexes when combined with polynucleotides or polypeptides. 

Immunodiagnostic Assays 

Neisserial antigens of the invention can be used in immunoassays to detect antibody 
levels (or, conversely, anti-Neisserial antibodies can be used to detect antigen levels). 
Immunoassays based on well defined, recombinant antigens can be developed to replace 
invasive diagnostics methods. Antibodies to Neisserial proteins within biological samples, 
including for example, blood or serum samples, can be detected. Design of the immunoassays 
is subject to a great deal of variation, and a variety of these are known in the art. Protocols for 
the immunoassay may be based, , for example, upon competition, or direct reaction, or 
sandwich type assays. Protocols may also, for example, use solid supports, or may be by 
immunoprecipitation. Most assays involve the use of labeled antibody or polypeptide; the 
labels may be, for example, fluorescent, chemiluminescent, radioactive, or dye molecules. 
Assays which amplify the signals from the probe are also known; examples of which are 
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assays which utilize biotin and avidin, and enzyme-labeled and mediated immunoassays, such 
as ELISA assays. 

Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are 
constructed by packaging the appropriate materials, including the compositions of the 
invention, in suitable containers, along with the remaining reagents and materials (for 
example, suitable buffers, salt solutions, etc.) required for the conduct of the assay, as well as 
suitable set of assay instructions. 

Nucleic Acid Hybridisation 

"Hybridization" refers to the association of two nucleic acid sequences to one another 
by hydrogen bonding. Typically, one sequence will be fixed to a solid support and the other 
will be free in solution. Then, the two sequences will be placed in contact with one another 
under conditions that favor hydrogen bonding. Factors that affect this bonding include: the 
type and volume of solvent; reaction temperature; time of hybridization; agitation; agents to 
block the non-specific attachment of the liquid phase sequence to the solid support 
(Denhardt's reagent or BLOTTO); concentration of the sequences; use of compounds to 
increase the rate of association of sequences (dextran sulfate or polyethylene glycol); and the 
stringency of the washing conditions following hybridization. See Sambrook et al. [supra] 
Volume 2, chapter 9, pages 9.47 to 9.57. 

"Stringency" refers to conditions in a hybridization reaction that favor association of 
very similar sequences over sequences that differ. For example, the combination of 
temperature and salt concentration should be chosen that is approximately 120 to 200DC 
below the calculated Tm of the hybrid under study. The temperature and salt conditions can 
often be determined empirically in preliminary experiments in which samples of genomic 
DNA immobilized on filters are hybridized to the sequence of interest and then washed under 
conditions of different stringencies. See Sambrook et al. at page 9.50. 

Variables to consider when performing, for example, a Southern blot are (1) the 
complexity of the DNA being blotted and (2) the homology between the probe and the 
sequences being detected. The total amount of the fragment(s) to be studied can vary a 
magnitudeof 10, from 0.1 to 1 ug for a plasmid or phage digest to 10- 9 to 10' 8 g for a single 
copy gene in a highly complex eukaryotic genome. For lower complexity polynucleotides 
substantially shorter blotting, hybridization, and exposure times, a smaller amount of starting 
polynucleotides, and lower specific activity of probes can be used. For example, a single-copy 
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yeast gene can be detected with an exposure time of only 1 hour starting with 1 jig of yeast 
DNA, blotting for two hours, and hybridizing for 4-8 hours with a probe of 10 8 cpm/ng. For a 
single-copy mammalian gene a conservative approach would start with 10 jig of DNA, blot 
overnight, and hybridize overnight in the presence of 10% dextran sulfate using a probe of 
greater than 10 8 cpm/jig, resulting in an exposure time of -24 hours. 

Several factors can affect the melting temperature (Tm) of a DNA-DNA hybrid 
between the probe and the fragment of interest, and consequently, the appropriate conditions 
for hybridization and washing. In many cases the probe is not 100% homologous to the 
fragment. Other commonly encountered variables include the length and total G+C content of 
the hybridizing sequences and the ionic strength and formamide content of the hybridization 
buffer. The effects of all of these factors can be approximated by a single equation: 

Tm= 81 + 16.6(logi 0 Ci) + 0.4[%(G + C)]-0.6(%formamide) - 600/w-1.5(%mismatch). 

where Ci is the salt concentration (monovalent ions) and n is the length of the hybrid 
in base pairs (slightly modified from Meinkoth & Wahl (1984) Anal Biochem. 138: 267-284). 

In designing a hybridization experiment, some factors affecting nucleic acid 
hybridization can be conveniently altered. The temperature of the hybridization and washes 
and the salt concentration during the washes are the simplest to adjust. As the temperature of 
the hybridization increases (ze. stringency), it becomes less likely for hybridization to occur 
between strands that are nonhomologous, and as a result, background decreases. If the 
radiolabeled probe is not completely homologous with the immobilized fragment (as is 
frequently the case in gene family and interspecies hybridization experiments), the 
hybridization temperature must be reduced, and background will increase. The temperature of 
the washes affects the intensity of the hybridizing band and the degree of background in a 
similar manner. The stringency of the washes is also increased with decreasing salt 
concentrations. 

In general, convenient hybridization temperatures in the presence of 50% formamide 
are 42DC for a probe with is 95% to 100% homologous to the target fragment, 37DC for 90% 
to 95% homology, and 32 DC for 85% to 90% homology. For lower homologies, formamide 
content should be lowered and temperature adjusted accordingly, using the equation above. If 
the homology between the probe and the target fragment are not known, the simplest 
approach is to start with both hybridization and wash conditions which are nonstringent. If 
non-specific bands or high background are observed after autoradiography, the filter can be 
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washed at high stringency and reexposed. If the time required for exposure makes this 
approach impractical, several hybridization and/or washing stringencies should be tested in 
parallel. 

Nucleic Acid Probe Assays 

Methods such as PGR, branched DNA probe assays, or blotting techniques utilizing 
nucleic acid probes according to the invention can determine the presence of cDNA or 
mRNA. A probe is said to "hybridize" with a sequence of the invention if it can form a duplex 
or double stranded complex, which is stable enough to be detected. 

The nucleic acid probes will hybridize to the Neisserial nucleotide sequences of the 
invention (including both sense and antisense strands). Though many different nucleotide 
sequences will encode the amino acid sequence, the native Neisserial sequence is preferred 
because it is the actual sequence present in cells. mRNA represents a coding sequence and so 
a probe should be complementary to the coding sequence; single-stranded cDNA is 
complementary to mRNA, and so a cDNA probe should be complementary to the non-coding 
sequence. 

The probe sequence need not be identical to the Neisserial sequence (or its 
complement) - some variation in the sequence and length can lead to increased assay 
sensitivity if the nucleic acid probe can form a duplex with target nucleotides, which can be 
detected. Also, the nucleic acid probe can include additional nucleotides to stabilize the 
formed duplex. Additional Neisserial sequence may also be helpful as a label to detect the 
formed duplex. For example, a non-complementary nucleotide sequence may be attached to 
the 5' end of the probe, with the remainder of the probe sequence being complementary to a 
Neisserial sequence. Alternatively, non-complementary bases or longer sequences can be 
interspersed into the probe, provided that the probe sequence has sufficient complementarity 
with the a Neisserial sequence in order to hybridize therewith and thereby form a duplex 
which can be detected. 

The exact length and sequence of the probe will depend on the hybridization 
conditions, such as temperature, salt condition and the like. For example, for diagnostic 
applications, depending on the complexity of the analyte sequence, the nucleic acid probe 
typically contains at least 10-20 nucleotides, preferably 15-25, and more preferably at least 30 
nucleotides, although it may be shorter than this. Short primers generally require cooler 
temperatures to form sufficiently stable hybrid complexes with the template. 
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Probes may be produced by synthetic procedures, such as the triester method of 
Matteucci et al. [J. Am. Chem. Soc. (1981) 103:3185], or according to Urdea et al [Proc. 
Natl Acad. Set USA (1983) 80: 7461], or using commercially available automated 
oligonucleotide synthesizers. 

The chemical nature of the probe can be selected according to preference. For certain 
applications, DNA or RNA are appropriate. For other applications, modifications may be 
incorporated eg. backbone modifications, such as phosphorothioates or methylphosphonates, 
can be used to increase in vivo half-life, alter RNA affinity, increase nuclease resistance etc. 
[eg. see Agrawal & Iyer (1995) Curr Opin Biotechnol 6:12-19; Agrawal (1996) TIBTECH 
14:376-387]; analogues such as peptide nucleic acids may also be used [eg. see Corey (1997) 
TIBTECH 15:224-229; Buchardt a/. (1993) TIBTECH 11:384-386]. 

One example of a nucleotide hybridization assay is described by Urdea et al. in 
international patent application WO92/02526 [see also US patent 5,124,246]. 

Alternatively, the polymerase chain reaction (PCR) is another well-known means for 
detecting small amounts of target nucleic acids. The assay is described in: Mullis et al [Meth. 
Enzymol (1987) 155: 335-350]; US patent 4,683,195; and US patent 4,683,202. Two 
"primer" nucleotides hybridize with the target nucleic acids and are used to prime the 
reaction. The primers can comprise sequence that does not hybridize to the sequence of the 
amplification target (or its complement) to aid with duplex stability or, for example, to 
incorporate a convenient restriction site. Typically, such sequence will flank the desired 
Neisserial sequence. 

A thermostable polymerase creates copies of target nucleic acids from the primers 
using the original target nucleic acids as a template. After a threshold amount of target nucleic 
acids are generated by the polymerase, they can be detected by more traditional methods, such 
as Southern blots. When using the Southern blot method, the labelled probe will hybridize to 
the Neisserial sequence (or its complement). 

Also, mRNA or cDNA can be detected by traditional blotting techniques described in 
Sambrook et al [supra]. mRNA, or cDNA generated from mRNA using a polymerase 
enzyme, can be purified and separated using gel electrophoresis. The nucleic acids on the gel 
are then blotted onto a solid support, such as nitrocellulose. The solid support is exposed to a 
labelled probe and then washed to remove any unhybridized probe. Next, the duplexes 
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containing the labeled probe are detected. Typically, the probe is labelled with a radioactive 
moiety. 



EXAMPLES 

The examples describe nucleic acid sequences which have been identified in 
N. meningitidis, and N. gonorrhoeae along with their respective and putative translation 
products. Not all of the nucleic acid sequences are complete ie. they encode less than the full- 
length wild-type protein. 

The examples are generally in the following format: 

• a nucleotide sequence which has been identified in N. meningitidis 

• the putative translation product of said N. meningitidis sequence 

• a computer analysis of said translation product based on database comparisons 

• a corresponding nucleotide sequence identified from N. gonorrhoeae 

• the putative translation product of said N. gonorrhoeae sequence 

• a comparison of the percentage of identity between the translation product of the 
N. meningitidis sequence and the N. gonorrhoeae sequence. 

• a corresponding nucleotide sequence identified from strain A of TV. meningitidis 

• the putative translation product of said N. meningitidis strain A sequence 

• a comparison of the percentage of identity between the translation product of the 
N. meningitidis sequence and the N. gonorrhoeae sequence. 

• a description of the characteristics of the protein which indicates that it might be 
suitably antigenic or immunogenic. 

Sequence comparisons were performed at NCBI (http://www.ncbi.nlm.nih.gov) using 
the algorithms BLAST, BLAST2, BLASTn, BLASTp, tBLASTn, BLASTx, & tBLASTx [eg. 
see also Altschul et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs. Nucleic Acids Research 25:2289-3402]. Searches were performed 
against the following databases: non-redundant GenBank+EMBL+DDBJ+PDB sequences 
and non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PIR sequences. 

Dots within nucleotide sequences represent nucleotides which have been arbitrarily 
introduced in order to maintain a reading frame. In the same way, double-underlined 
nucleotides were removed . Lower case letters represent ambiguities which arose during 
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alignment of independent sequencing reactions (some of the nucleotide sequences in the 
examples are derived from combining the results of two or more experiments). 

Nucleotide sequences were scanned in all six reading frames to predict the presence of 
hydrophobic domains using an algorithm based on the statistical studies of Esposti et al 
[Critical evaluation of the hydropathy of membrane proteins (1990) EurJBiochem 190:207- 
219]. These domains represent potential transmembrane regions or hydrophobic leader 
sequences. 

Open reading frames were predicted from fragmented nucleotide sequences using the 
program ORFFINDER (NCBI). 

Underlined amino acid sequences indicate possible transmembrane domains or leader 
sequences in the ORFs, as predicted by the PSORT algorithm (http://www.psort.nibb.ac.jp). 
Functional domains were also predicted using the MOTIFS program (GCG Wisconsin & 
PROSITE). 

For each of the following examples: based on the presence of a putative leader 
sequence and/or several putative transmembrane domains (single-underlined) in the 
gonococcal protein, it is predicted that the proteins from N. meningitidis and N. gonorrhoeae, 
and their respective epitopes, could be useful antigens or immunogenic compositions for 
vaccines or diagnostics. 

The standard techniques and procedures which may be employed in order to perform 
the invention (e.g. to utilize the disclosed sequences for vaccination or diagnostic purposes) 
were summarized above. This summary is not a limitation on the invention but, rather, gives 
examples that may be used, but are not required. 

In particular, the following methods were used to express, purify and biochemically 
characterize the proteins of the invention. 
Chromosomal DNA Preparation 

N.meningitidis strain 2996 was grown to exponential phase in 100ml of GC medium, 
harvested by centrifugation, and resuspended in 5ml buffer (20%(w/v) Sucrose, 50mM Tris- 
HC1, 50mM EDTA, pH8). After 10 minutes incubation on ice, the bacteria were lysed by 
adding 10ml of lysis solution (50mM NaCl, 1% Na-Sarkosyl, 50ng/ml Proteinase K), and the 
suspension incubated at 37°C for 2 hours. Two phenol extractions (equilibrated to pH 8) and 
one CHCl 3 /isoamylalcohol (24:1) extraction were performed. DNA was precipitated by 
addition of 0.3M sodium acetate and 2 volumes of ethanol, and collected by centrifugation. 
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The pellet was washed once with 70%(v/v) ethanol and redissolved in 4.0ml TE buffer 
(lOmM Tris-HCl, ImM EDTA, pH 8.0). The DNA concentration was measured by reading 
the OD at 260 nm. 

Oligonucleotide design 

Synthetic oligonucleotide primers were designed on the basis of the coding sequence 
of each ORF, using (a) the meningococcus B sequence when available, or (b) the 
gonococcus/meningococcus A sequence, adapted to the codon preference usage of 
meningococcus as necessary. Any predicted signal peptides were omitted, by designing the 5' 
primers to sequence immediately downstream from the predicted leader sequence. 

For most ORFs, the 5' primers included two restriction enzyme recognition sites 
(BamHl-Ndel, BamHl-Nhel, EcoRl-Ndel or EcoRl-Nhel), depending on the restriction pattern 
of the gene of interest. The 3' primers included a^ol or a HindEI restriction site (table 1). 
This procedure was established in order to direct the cloning of each amplification product 
(corresponding to each ORF) into two different expression systems: pGEX-KG (using 
BamHl-Xhol, BamHl-Hindlll, EcoRl-Xhol or EcoRl-HindRl), and pET2 lb+ (using Ndel- 
Xhol, Nhel-Xhol, Ndel-Hindlll or Nhel-Hindlll). 

5 '-end primer tail: CGC GGATCCCATATG {BamHl-Ndel) 

CGC GGATCCGCTAGC (BamHl-Nhel) 
CCG GAATTC T ACATATG (EcoRl-Ndel) 
CC GGAATTC T AGCTAGC (EcoRl-Nhel) 

3 '-end primer tail: CCCGCTCGAG (Xhol) 

CCCG CTCGAG (HindUI) 

For cloning ORFs into the pGEX-His vector, the 5' and 3' primers contained only one 
restriction enzyme site (EcoRl, Kpnl or Sail for the 5' primers and Pstl, Xbal, Sphl or Sail for 
the 3' primers). Again restriction sites were chosen according to the particular restriction 
pattern of the gene (table 1 ). 

5 '-end primer tail: ( AAA ) AA AGAATTC (EcoRl ) 

( AAA ) AA AGGTACC (Kpnl) 
3 '-end primer tail: (AAA) AA ACTGCAG (p s tl) 

( AAA ) AA ATCTAGA (Xbal) 
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AAAGCATGC (Sphl) 
5 ' or 3 '-end primer tail: AAAAAA GTCGAC (Sail) 

As well as containing the restriction enzyme recognition sequences, the primers 
included nucleotides which hybridized to the sequence to be amplified. The melting 
temperature depended on the number and type of hybridising nucleotides in the whole primer, 
and was determined for each primer using the formulae: 

T m = 4 (G+C)+ 2 (A+T) (tail excluded) 

T m = 64.9 + 0.41 (% GC) - 600/N (whole primer) 

The melting temperatures of the selected oligonucleotides were usually 65-70°C for 
the whole oligo and 50-5 5°C for the hybridising region alone. 

Table 1 shows the forward and reverse primers used for each amplification. In certain 
cases, the sequence of the primer does not exactly match the sequence of the predicted ORF. 
This is because when initial amplifications were performed, the complete 5' and/or 3' 
sequences for some meningococcal B ORFs were not known. However the corresponding 
sequences had been identified in Gonococcus or in Meningoccus A. Hence, when the 
Meningoccus B sequence was incomplete or uncertain, Gonococcal or Meningococcal A 
sequences were used as the basis for primer design. These sequences were altered to take 
account of codon preference. It can be appreciated that, once the complete sequence is 
identified, this approach will no longer be necessary. 

Oligonucleotides were synthesized using a Perkin Elmer 394 DNA/RNA Synthesizer, 
eluted from the columns in 2.0ml NH4OH, and deprotected by 5 hours incubation at 56°C. 
The oligos were precipitated by addition of 0.3M Na- Acetate and 2 volumes ethanol. The 
samples were centrifuged and the pellets resuspended in either lOOjal or 1.0ml of water. The 
OD 2 60 was determined using a Perkin Elmer Lambda Bio spectophotometer and the 
concentration adjusted to 2-lOpmol/jj.l. 

Amplification 

The standard PCR protocol was as follows: 50-200ng of genomic DNA was used as a 
template in the presence of 20-40jaM of each oligonucletide primer, 400-800fiM dNTPs 
solution, Ix PCR buffer (including 1.5mM MgCl 2 ), 2.5 units TaqI UNA polymerase (using 
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Perkin-Elmer AmpliTaQ, GIBCO Platinum, Pwo DNA polymerase, or Tahara Shuzo Taq 
polymerase). In some cases, PCR was optimsed by the addition of lOul DMSO or 50ul 2M 
Betaine. 



After a hot start (adding the polymerase during a preliminary 3 minute incubation of 
the whole mix at 95°C), each sample underwent a two-step amplification. The first 5 cycles 
were performed using the hybridization temperature that excluded the restriction enzyme tail 
of the primer (see above). This was followed by 30 cycles using the hybridization temperature 
calculated for the whole length oligos. The cycles were completed with a 10 minute extension 
step at 72°C. The standard cycles were as follows: 





Denaturation 


Hybridisation 


Elongation 


First 5 cycles 


30 seconds 
95°C 


30 seconds 
50-55°C 


30-60 seconds 
72°C 


Last 30 cycles 


30 seconds 
95°C 


30 seconds 
65-70°C 


30-60 seconds 
72°C 



Elongation times varied according to the length of the ORF to be amplified. 
Amplifications were performed using either a 9600 or a 2400 Perkin Elmer GeneAmp PCR 
System. To check the results, 1/10 of the amplification volume was loaded onto a 1-1.5% 
(w/v) agarose gel and the size of each amplified fragment compared with a DNA molecular 
weight marker. 

The amplified DNA was either loaded directly on a 1% agarose gel or first 
precipitated with ethanol and resuspended in a volume suitable to be loaded on a 1.0% 
agarose gel. The DNA fragment corresponding to the band of correct size was purified using 
the Qiagen Gel Extraction Kit, following the manufacturer's protocol. DNA fragments were 
eluted in a volume of 30ul or 50ul with either H20 or lOmM Tris, pH 8.5. 

Digestion of PCR fragments 

The purified DNA corresponding to the amplified fragment was doubly-digested with 
the appropriate restriction enzymes for; cloning into P ET-21b+ and expressing the protein as a 
C-terminus His-tagged fusion, for cloning into pGEX-KG and expressing the protein as a N- 
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terminus GST-fusion, and for cloning into pGEX-His and expressing the protein as a 
N-terminus GST-His tagged fusion. 

Each purified DNA fragment was incubated at 37°C for 3 hours to overnight with 20 
units of appropriate restriction enzyme (New England Biolabs) in a volume of either 30 or 
40ul in the presence of suitable digestion buffer. Digested fragments were purified using the 
QIAquick PCR purification kit (following the manufacturer's instructions) and eluted in a 
volume of 30ul or 50ul with either H20 or lOmM Tris, pH 8.5. The DNA concentration was 
determined by quantitative agarose gel electrophoresis (1.0% gel) in the presence of a titrated 
molecular weight marker. 



Digestion of the cloning vectors (pET22B, pGEX-KG, pTRC-His A, pET21b+, pGEX- 
KG, and pGEX-His) 

The vector pGEX-His is a modified pGEX-2T vector carrying a region encoding six 
histidine residues upstream of the thrombin cleavage site and containing the multiple cloning 
site of the vector pTRC99 (Pharmacia). 10 ug plasmid was double-digested with 50 units of 
each restriction enzyme in 200 ul reaction volume in the presence of appropriate buffer by 
overnight incubation at 37°C. After loading the whole digestion on a 1% agarose gel, the 
band corresponding to the digested vector was purified from the gel using the Qiagen 
QIAquick Gel Extraction Kit and the DNA was eluted in 50 ul of 10 mM Tris-HCl, pH 8.5. 
The DNA concentration was evaluated by measuring OD 26 o of the sample, and adjusted to 50 
(j.g/fj.1. 1 ul of plasmid was used for each cloning procedure. 

lOug of plasmid vector was doubly-digested with 50 units of each restriction enzyme 
in a volume of 200^1 with the appropriate buffer overnight at 37°C. The digest was loaded 
onto a 1.0% agarose gel and the band corresponding to the digested vector purified using the 
Qiagen QIAquick Gel Extraction Kit. DNA was eluted in 50ul of lOmM Tris-HCl, pH 8.5. 
The DNA concentration was evaluated by measuring OD 26 on m and the concentration adjusted 
to 50u.g/ul lul of plasmid was used for each cloning procedure. 

Cloning 

For some ORFs, the fragments corresponding to each ORF, previously digested and 
purified, were ligated in both pET22b and pGEX-KG. In a final volume of 20 ^1, a molar 
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ratio of 3: 1 fragment/vector was ligated using 0.5 ul of NEB T4 DNA ligase (400 units/ul), in 
the presence of the buffer supplied by the manufacturer. The reaction was incubated at room . 
temperature for 3 hours. In some experiments, ligation was performed using the Boheringer 
"Rapid Ligation Kit", following the manufacturer's instructions. 

In order to introduce the recombinant plasmid in a suitable strain, 1 00 ul E coli DH5 
competent cells were incubated with the ligase reaction solution for 40 minutes on ice, then at 
37°C for 3 minutes, then, after adding 800 ul LB broth, again at 37°C for 20 minutes. The 
cells were then centrifuged at maximum speed in an Eppendorf microfuge and resuspended in 
approximately 200 ul of the supernatant. The suspension was then plated on LB ampicillin 
(lOOmg/ml). 

The screening of the recombinant clones was performed by growing 5 
randomly-chosen colonies overnight at 37 °C in either 2 ml (pGEX or pTC clones) or 5ml 
(PET clones) LB broth + 100 ug/ml ampicillin. The cells were then pelletted and the DNA 
extracted using the Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's 
instructions, to a final volume of 30 ul. 5 ul of each individual miniprep (approximately lg ) 
were digested with either NdeVXhol or BamMIXhoI and the whole digestion loaded onto a 1- 
1.5% agarose gel (depending on the expected insert size), in parallel with the molecular 
weight marker (1Kb DNA Ladder, GIBCO). The screening of the positive clones was made 
on the base of the correct insert size. 

For other ORFs, the fragments corresponding to each ORF, previously digested and 
purified, were ligated into both pET21b+ and pGEX-KG. A molar ratio of of 3:1 
fragment/vector was used in a final volume of 20ul, that included O.Sul T4 DNA ligase (400 
units/ul, NEB) and ligation buffer supplied by the manufacturer. The reaction was performed 
at room temperature for 3 hours. In some experiments, ligation was performed using the 
Boheringer "Rapid Ligation Kit" and the manufacturer's protocol. 

Recombinant plasmid was transformed into lOOul of competent E coli DH5 or 
HBlOl by incubating the ligase reaction solution and bacteria for 40 minutes on ice then at 
37°C for 3 minutes. This was followed by the addition of 800ul LB broth and incubation at 
37°C for 20 minutes. The cells were centrifuged at maximum speed in an Eppendorf 
microfuge, resuspended in approximately 200ul of the supernatant and plated onto LB 
ampicillin (lOOmg/ml ) agar. 
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Screening for recombinant clones was performed by growing 5 randomly selected 
colonies overnight at 37°C in either 2.0ml (pGEX-KG clones) or 5.0ml (pET clones) LB 
broth + lOOjag/ml ampicillin. Cells were pelleted and plasmid DNA extracted using the 
Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's instructions. 
Approximately lp,g of each individual miniprep was digested with the appropriate restriction 
enzymes and the digest loaded onto a 1-1.5% agarose gel (depending on the expected insert 
size), in parallel with the molecular weight marker (lkb DNA Ladder, GIBCO). Positive 
clones were selected on the basis of the size of insert. 

ORFs were cloned into PGEX-His, by doubly-digesting the PCR product and ligating 
into similarly digested vector. After cloning, recombinant plasmids were transformed into the 
E.coli host W31 10. Individual clones were grown overnight at 37°C in LB broth with 50jag/ml 
ampicillin. 

Certain ORFs may be cloned into the pGEX-HIS vector using EcoRl-Pstl cloning 
sites, or EcoRl-SaR, or SaK-PstL After cloning, the recombinant plasmids may be introduced 
in the £.coli host W3 1 10. 

Expression 

Each ORF cloned into the expression vector may then be transformed into the strain 
suitable for expression of the recombinant protein product. 1 jil of each construct was used to 
transform 30 nl of Ecoli BL21 (pGEX vector), E.coli TOP 10 (pTRC vector) or E.coli BL21- 
DE3 (pET vector), as described above. In the case of the pGEX-His vector, the same E.coli 
strain (W31 10) was used for initial cloning and expression. Single recombinant colonies were 
inoculated into 2ml LB+Amp (100 ng/ml), incubated at 37°C overnight, then diluted 1:30 in 
20 ml of LB+Amp (100 ^ig/ml) in 100 ml flasks, making sure that the OD 6 oo ranged between 
0.1 and 0.15. The flasks were incubated at 30°C into gyratory water bath shakers until OD 
indicated exponential growth suitable for induction of expression (0.4-0.8 OD for pET and 
pTRC vectors; 0.8-1 OD for pGEX and pGEX-His vectors). For the pET, pTRC and pGEX- 
His vectors, the protein expression was induced by addiction of ImM IPTG, whereas in the 
case of pGEX system the final concentration of IPTG was 0.2 mM. After 3 hours incubation 
at 30°C, the final concentration of the sample was checked by OD. In order to check 
expression, 1ml of each sample was removed, centrifuged in a microfuge, the pellet 
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resuspended in PBS, and analysed by 12% SDS-PAGE with Coomassie Blue staining. The 
whole sample was centrifuged at 6000# and the pellet resuspended in PBS for further use. 

GST-fusion proteins large-scale purification. 

For some ORFs, a single colony was grown overnight at 37°C on LB+Amp agar plate. 
The bacteria were inoculated into 20 ml of LB+Amp liquid colture in a water bath shaker and 
grown overnight. Bacteria were diluted 1 :30 into 600 ml of fresh medium and allowed to 
grow at the optimal temperature (20-37°C) to OD 5S o 0.8-1 . Protein expression was induced 
with 0.2mM IPTG followed by three hours incubation. The culture was centrifuged at 8000 
rpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 7.5 ml 
cold PBS. The cells were disrupted by sonication on ice for 30 sec at 40W using a Branson 
sonifier B-15, frozen and thawed two times and centrifuged again. The supernatant was 
collected and mixed with 150ul Glutatione-Sepharose 4B resin (Pharmacia) (previously 
washed with PBS) and incubated at room temperature for 30 minutes. The sample was 
centrifuged at 700g for 5 minutes at 4C. The resin was washed twice with 10 ml cold PBS for 
10 minutes, resuspended in 1ml cold PBS, and loaded on a disposable column. The resin was 
washed twice with 2ml cold PBS until the flow-through reached OD 28 o of 0.02-0.06. The 
GST-fusion protein was eluted by addition of 700^1 cold Glutathione elution buffer lOmM 
reduced glutathione, 50mM Tris-HCl) and fractions collected until the OD 28 o was 0. 1 . 21 ul of 
each fraction were loaded on a 12% SDS gel using either Biorad SDS-PAGE Molecular 
weight standard broad range (Ml) (200, 1 16.25, 97.4, 66.2, 45, 31, 21.5, 14.4, 6.5 kDa) or 
Amersham Rainbow Marker (M") (220, 66, 46, 30, 21.5, 14.3 kDa) as standards. As the MW 
of GST is 26kDa, this value must be added to the MW of each GST-fusion protein. 

For other ORFs, for each clone to be purified as a GST-fusion, a single colony was 
streaked out and grown overnight at 37°C on a LB/Amp. (100ng/ml) agar plate. An isolated 
colony from this plate was inoculated into 20ml of LB/Amp (100 ug/ml) liquid medium and 
grown overnight at 37°C with shaking. The overnight culture was diluted 1 :30 into 600ml 
LB/Amp (100fig/ml) liquid medium and allowed to grow at the optimal temperature (20- 
37°C) until the OD 5 50n m reached 0.6-0.8. Recombinant protein expression was induced by 
addition of IPTG (final concentration 0.2mM) and the culture incubated for a further 3 hours. 
Bacteria were harvested by centrifugation at 8000xg for 15 min at 4°C. 
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The bacterial pellet was resuspended in 7.5ml cold PBS. Cells were disrupted by 
sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 and centrifuged at 
13 OOOxg for 30 min at 4°C. The supernatant was collected and mixed with 150ul Glutatione- 
Sepharose 4B resin (Pharmacia), previously equilibrated with PBS, and incubated at room 
temperature with gentle agitation for 30 min. The batch-wise preparation was centrifuged at 
700xg for 5 min at 4°C and the supernatant discarded. The resin was washed twice 
(batchwise) with 10ml cold PBS for 10 min, resuspended in 1ml cold PBS, and loaded onto a 
disposable column. The resin continued to be washed with cold PBS, until the OD 28 onm of the 
flow-through reached 0.02-0.01. The GST-fusion protein was eluted by addition of 700^1 cold 
glutathione elution buffer (lOmM reduced glutathione, 50mM Tris-HCl pH 8.0) and fractions 
collected, until the OD 28 onni of the eluate indicated all the recombinant protein was obtained. 
20ul aliquots of each elution fraction were analyzed by SDS-PAGE using a 12% gel. The 
molecular mass of the purified proteins was determined using either the Bio-Rad broad range 
molecular weight standard (Ml) (200, 1 16, 97.4, 66.2, 45.0, 31.0, 21.5, 14.4, 6.5 kDa) or the 
Amersham Rainbow Marker (M2) (220, 66.2, 46.0, 30.0, 21.5, 14.3 kDa). The molecular 
weights of GST-fusion proteins are a combination of the 26 kDa GST protein and its fusion 
partner. Protein concentrations were estimated using the Bradford assay. 

His-fusion soluble proteins large-scale purification. 

For some ORFs, a single colony was grown overnight at 37°C on a LB + Amp agar '"** 
plate. The bacteria were inoculated into 20ml of LB+Amp liquid culture and incubated 
overnight in a water bath shaker. Bacteria were diluted 1 :30 into 600ml fresh medium and 
allowed to grow at the optimal temperature (20-37°C) to OD 550 0.6-0.8. Protein expression 
was induced by addition of 1 mM IPTG and the culture further incubated for three hours. The 
culture was centrifuged at 8000 rpm at 4°C, the supernatant was discarded and the bacterial 
pellet was resuspended in 7.5ml cold lOmM imidazole buffer (300 mM NaCl, 50 mM 
phosphate buffer, 10 mM imidazole, pH 8). The cells were disrupted by sonication on ice for 
30 sec at 40W using a Branson sonifier B-15, frozen and thawed two times and centrifuged 
again. The supernatant was collected and mixed with 150ul Ni 2+ -resin (Pharmacia) 
(previously washed with lOmM imidazole buffer) and incubated at room temperature with 
gentle agitation for 30 minutes. The sample was centrifuged at 700g for 5 minutes at 4°C. 
The resin was washed twice with 10 ml cold lOmM imidazole buffer for 10 minutes, 
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resuspended in 1ml cold lOmM imidazole buffer and loaded on a disposable column. The 
resin was washed at 4°C with 2ml cold lOmM imidazole buffer until the flow-through 
reached the O.D 280 of 0.02-0.06. The resin was washed with 2ml cold 20mM imidazole 
buffer (300 mM NaCl, 50 mM phosphate buffer, 20 mM imidazole, pH 8) until the flow- 
through reached the O.D 280 of 0.02-0.06. The His-fusion protein was eluted by addition of 
700ul cold 250mM imidazole buffer (300 mM NaCl, 50 mM phosphate buffer, 250 mM 
imidazole, pH 8) and fractions collected until the O.D 280 was 0.1. 21ul of each fraction were 
loaded on a 12% SDS gel. 

His-fusion insoluble proteins large-scale purification. 

A single colony was grown overnight at 37 °C on a LB + Amp agar plate. The 
bacteria were inoculated into 20 ml of LB+Amp liquid culture in a water bath shaker and 
grown overnight. Bacteria were diluted 1 :30 into 600ml fresh medium and let to grow at the 
optimal temperature (37°C) to O.D 550 0.6-0.8. Protein expression was induced by addition 
of 1 mM IPTG and the culture further incubated for three hours. The culture was centrifuged 
at SOOOrpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 
7.5 ml buffer B (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 8.8). The cells 
were disrupted by sonication on ice for 30 sec at 40W using a Branson sonifier B-15, frozen 
and thawed twice and centrifuged again. The supernatant was stored at -20°C, while the 
pellets were resuspended in 2 ml guanidine buffer (6M guanidine hydrochloride, lOOmM 
phosphate buffer, 10 mM Tris-HCl, pH 7.5) and treated in a homogenizer for 10 cycles. The 
product was centrifuged at 13000 rpm for 40 minutes. The supernatant was mixed with 150ul 
Ni 2+ -resin (Pharmacia) (previously washed with buffer B) and incubated at room temperature 
with gentle agitation for 30 minutes. The sample was centrifuged at 700 g for 5 minutes at 
4°C. The resin was washed twice with 10 ml buffer B for 10 minutes, resuspended in 1ml 
buffer B, and loaded on a disposable column. The resin was washed at room temperature 
with 2ml buffer B until the flow-through reached the OD 280 of 0.02-0.06. The resin was 
washed with 2ml buffer C (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 6.3) until 
the flow-through reached the O.D 280 of 0.02-0.06. The His-fusion protein was eluted by 
addition of 700ul elution buffer (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 
4.5) and fractions collected until the OD 280 was 0.1 . 21ul of each fraction were loaded on a 
12% SDS gel. 
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Purification of His-fusion proteins. 

For each clone to be purified as a His-fusion, a single colony was streaked out and 
grown overnight at 37°C on a LB/Amp (100 ug/ml) agar plate. An isolated colony from this 
plate was inoculated into 20ml of LB/Amp (100 fig/ml) liquid medium and grown overnight 
at 37°C with shaking. The overnight culture was diluted 1 :30 into 600ml LB/Amp (100 
Ug/ml) liquid medium and allowed to grow at the optimal temperature (20-3 7°C) until the 
ODssonm reached 0.6-0.8. Expression of recombinant protein was induced by addition of IPTG 
(final concentration 1 .OmM) and the culture incubated for a further 3 hours. Bacteria were 
harvested by centrifugation at 8000xg for 15 min at 4°C. 

The bacterial pellet was resuspended in 7.5ml of either (i) cold buffer A (300mM 
NaCl, 50mM phosphate buffer, lOmM imidazole, pH 8.0) for soluble proteins or (ii) buffer B 
(8M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 8.8) for insoluble proteins. Cells 
were disrupted by sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 
and centrifuged at 13 OOOxg for 30 min at 4°C. For insoluble proteins, pellets were 
resuspended in 2.0 ml buffer C (6M guanidine hydrochloride, lOOmM phosphate buffer, 
lOmM Tris-HCl, pH 7.5) and treated with a Dounce homogenizer for 10 cycles. The 
homogenate was centrifuged at 13 OOOxg for 40 min and the supernatant retained. 

Supernatants for both soluble and insoluble preparations were mixed with 150^1 Ni 2+ - 
resin (previously equilibrated with either buffer A or buffer B, as appropriate) and incubated " 
at room temperature with gentle agitation for 30 min. The resin was Chelating Sepharose Fast 
Flow (Pharmacia), prepared according to manufacturers protocol. The batch-wise preparation 
was centrifuged at 700xg- for 5 min at 4°C and the supernatant discarded. The resin was 
washed twice (batch-wise) with 10ml buffer A or B for 10 min, resuspended in 1.0 ml buffer 
A or B and loaded onto a disposable column. The resin continued to be washed with either (i) 
buffer A at 4°C or (ii) buffer B at room temperature, until the OD 28 onm of the flow-through 
reached 0.02-0.01. The resin was further washed with either (i) cold buffer C (300mM NaCl, 
50mM phosphate buffer, 20mM imidazole, pH 8.0) or (ii) buffer D (8M urea, lOmM Tris- 
HCl, lOOmM phosphate buffer, pH 6.3) until the the OD 28 onm of the flow-through reached 
0.02-0.01. The His-fusion protein was eluted by addition of 700^1 of either (i) cold elution 
buffer A (300mM NaCl, 50mM phosphate buffer, 250mM imidazole, pH 8.0) or (ii) elution 
buffer B (8 M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 4.5) and fractions 
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collected until the O.D 280nm indicated all the recombinant protein was obtained. 20ul aliquots 
of each elution fraction were analyzed by SDS-PAGE using a 12% gel. Protein concentrations 
were estimated using the Bradford assay. 

His-fusion proteins renaturation 

In the cases where denaturation was required to solubilize proteins, a renaturation step 
was employed prior to immunization. Glycerol was added to the denatured fractions obtained 
above to give a final concentration of 10%(v/v). The proteins were diluted to 200ug/ml using 
dialysis buffer I (10% (v/v) glycerol, 0.5M arginine, 50mM phosphate buffer, 5.0mM reduced 
glutathione, 0.5mM oxidised glutathione, 2.0M urea, pH 8.8) and dialysed against the same 
buffer for 12-14 hours at 4°C. Further dialysis was performed with buffer II (10% (v/v) 
glycerol, 0.5M arginine, 50mM phosphate buffer, 5.0mM reduced glutathione, O.SmM 
oxidised glutathione, pH 8.8) for 12-14 hours at 4°C. 

Alternatively, 10% glycerol was added to the denatured proteins. The proteins were 
then diluted to 20ng/ml using dialysis buffer I (10% glycerol, 0.5M arginine, 50mM 
Phosphate buffer, 5mM reduced glutathione, O.SmM oxidised glutathione, 2M urea P H 8 8) 
and dialysed against the same buffer at 4°C for 12-14 hours. The protein was further dialysed 
against dialysis buffer II (10% glycerol, 0.5M arginine, 50mM phosphate buffer, 5mM 
reduced glutathione, 0.5mM oxidised glutathione, pH 8.8) for 12-14 hours at 4°C. 
Protein concentration was evaluated using the formula: 

Protein (mg/ml) = (1.55 x OD 280 ) - (0.76 x OD 260 ) 

Purification of proteins 

To analyse the solubility, pellets obtained from 3.0ml cultures were resuspended in 
500ul buffer Ml (PBS pH 7.2). 25ul of lysozyme (lOmg/ml) was added and the bacteria 
incubated for 15 min at 4°C. Cells were disrupted by sonication on ice four times for 30 sec at 
40W using a Branson sonifier 450 and centrifuged at 13 OOOxg for 30 min at 4°C. The 
supernatant was collected and the pellet resuspended in buffer M2 [8M urea, 0.5M NaCl, 
20mM imidazole and 0. 1M NaH 2 P0 4 ] and incubated for 3 to 4 hours at 4°C. After 
centrifugation, the supernatant was collected and the pellet resuspended in buffer M3 [6M 
guanidinium-HCl, 0.5M NaCl, 20mM imidazole and 0.1M NaH 2 P0 4 ] overnight at 4°C The 
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supernatants from all steps were analysed by SDS-PAGE. Some proteins were found to be 
soluble in PBS, others need urea or guanidium-HCl for solubilization. 

For preparative scale purifications, 500ml cultures were induced and fusion proteins 
solubilized in either buffer Ml, M2 or M3 using the procedure described above. Crude 
extracts were loaded onto a Ni-NTA superflow column (Quiagen) equilibrated with buffer 
Ml, M2 or M3 depending on the solubilization buffer employed. Unbound material was 
eluted by washing the column with the same buffer. The recombinant fusion protein was 
eluted with the corresponding buffer containing 500mM imidazole then dialysed against the 
same buffer in the absence of imidazole. 

Mice immunisations 

20p,g of each purified protein are used to immunise mice intraperitoneally . In the case 
of some ORFs, Balb-C mice were immunised with Al(OH) 3 as adjuvant on days 1,21 and 42, 
and immune response was monitored in samples taken on day 56. For other ORFs, CD1 mice 
could be immunised using the same protocol. For ORFs 25 and 40, CD1 mice were 
immunised using Freund's adjuvant, and the same immunisation protocol was used, except 
that the immune response was measured on day 42, rather than 56. Similarly, for still other 
ORFs, CD1 mice were immunised with Freund's adjuvant, but the immune response was 
measured on day 49. Alternatively, 20^ig of each purified protein was mixed with Freund's 
adjuvant and used to immunise CD1 mice intraperitoneally. For many of the proteins, the 
immunization was performed on days 1,21 and 35, and immune response was monitored in 
samples taken on days 34 and 49. For some proteins, the third immunization was performed 
on day 28, rather than 35, and the immune response was measured on days 20 and 42, rather 
than 34 and 49. 

ELISA assay (sera analysis) 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C. Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 7ml of Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD 6 2o* The 
bacteria were let to grow until the OD reached the value of 0.3-0.4. The culture was 
centrifuged for 10 minutes at 10000 rpm. The supernatant was discarded and bacteria were 
washed once with PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated 
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for 2 hours at room temperature and then overnight at 4°C with stirring. 100^1 bacterial cells 
were added to each well of a 96 well Greiner plate and incubated overnight at 4°C. The wells 
were then washed three times with PBT washing buffer (0. 1 % Tween-20 in PBS). 200 ul of 
saturation buffer {2.1% Polyvinylpyrrolidone 10 in water) was added to each well and the 
plates incubated for 2 hours at 37°C. Wells were washed three times with PBT. 200 m of 
diluted sera (Dilution buffer: 1% BSA, 0.1% Tween-20, 0.1% NaN 3 in PBS) were added to 
each well and the plates incubated for 90 minutes at 37°C. Wells were washed three times 
with PBT. 100 nl of HRP-conjugated rabbit anti-mouse (Dako) serum diluted 1:2000 in 
dilution buffer were added to each well and the plates were incubated for 90 minutes at 37°C. 
Wells were washed three times with PBT buffer. 100 ul of substrate buffer for HRP (25 ml of 
citrate buffer pH5, 10 mg of O-phenildiamine and 10 ul of H 2 0) were added to each well and 
the plates were left at room temperature for 20 minutes. 100 ul H 2 S0 4 was added to each 
well and OD 490 was followed. The ELISA was considered positive when OD490 was 2.5 
times the respective pre-immune sera. 

Alternatively, The acapsulated MenB M7 strain was plated on chocolate agar plates 
and incubated overnight at 37°C. Bacterial colonies were collected from the agar plates using 
a sterile dracon swab and inoculated into Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD 620 . The bacteria 
were let to grow until the OD reached the value of 0.3-0.4. The culture was centrifuged for 10 
minutes at 10 OOOrpm. The supernatant was discarded and bacteria were washed once with 
PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated for 1 hour at 37°C 
and then overnight at 4°C with stirring. lOOul bacterial cells were added to each well of a 96 
well Greiner plate and incubated overnight at 4°C. The wells were then washed three times 
with PBT washing buffer (0.1% Tween-20 in PBS). 200ul of saturation buffer (2.7% 
Polyvinylpyrrolidone 10 in water) was added to each well and the plates incubated for 2 hours 
at 37°C. Wells were washed three times with PBT. 200ul of diluted sera (Dilution buffer 1% 
BSA, 0.1% Tween-20, 0.1% NaN 3 in PBS) were added to each well and the plates incubated 
for 2 hours at 37°C. Wells were washed three times with PBT. lOOul of HRP-conjugated 
rabbit anti-mouse (Dako) serum diluted 1 :2000 in dilution buffer were added to each well and 
the plates were incubated for 90 minutes at 37°C. Wells were washed three times with PBT 
buffer. lOOul of substrate buffer for HRP (25ml of citrate buffer pH5, lOmg of O- 
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phenildiamine and lOfal of H 2 0 2 ) were added to each well and the plates were left at room 
temperature for 20 minutes. lOOul of 12.5% H 2 S0 4 was added to each well and OD 490 was - 
followed. The ELISA titers were calculated abitrarely as the dilution of sera which gave an 
OD490 value of 0.4 above the level of preimmune sera. The ELISA was considered positive 
when the dilution of sera with OD490 of 0.4 was higher than 1 :400. 

FACScan bacteria Binding Assay procedure. 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C. Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 4 tubes containing 8ml each Mueller-Hinton Broth (Difco) 
containing 0.25% glucose. Bacterial growth was monitored every 30 minutes by following 
OD 6 2o- The bacteria were let to grow until the OD reached the value of 0.35-0.5. The culture 
was centrifuged for 10 minutes at 4000rpm. The supernatant was discarded and the pellet was 
resuspended in blocking buffer (1% BSA in PBS, 0.4% NaN 3 ) and centrifuged for 5 minutes 
at 4000rpm. Cells were resuspended in blocking buffer to reach OD 620 of 0.07. lOOul bacterial 
cells were added to each well of a Costar 96 well plate. lOOul of diluted (1 : 100, 1 :200, 1 :400) 
sera (in blocking buffer) were added to each well and plates incubated for 2 hours at 4°C. 
Cells were centrifuged for 5 minutes at 4000rpm, the supernatant aspirated and cells washed 
by addition of 200ul/well of blocking buffer in each well. lOOul of R-Phicoerytrin conjugated 
F(ab) 2 goat anti-mouse, diluted 1 :100, was added to each well and plates incubated for 1 hour 
at 4°C. Cells were spun down by centrifugation at 4000rpm for 5 minutes and washed by 
addition of 200u.l/well of blocking buffer. The supernatant was aspirated and cells 
resuspended in 200nl/well of PBS, 0.25% formaldehyde. Samples were transferred to 
FACScan tubes and read. The condition for FACScan (Laser Power 15mW) setting were: FL2 
on; FSC-H threshold:92; FSC PMT Voltage: E 01; SSC PMT: 474; Amp. Gains 6.1; FL-2 
PMT: 586; compensation values: 0. 

OMV preparations 

Bacteria were grown overnight on 5 GC plates, harvested with a loop and resuspended 
in 10 ml 20mM Tris-HCl. Heat inactivation was performed at 56°C for 30 minutes and the 
bacteria disrupted by sonication for 10' on ice ( 50% duty cycle, 50% output ). Unbroken 
cells were removed by centrifugation at SOOOg for 10 minutes and the total cell envelope 
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fraction recovered by centrifugation at 50000^ at 4°C for 75 minutes. To extract cytoplasmic 
membrane proteins from the crude outer membranes, the whole fraction was resuspended in • 
2% sarkosyl (Sigma) and incubated at room temperature for 20 minutes. The suspension was 
centrifuged at 10000^ for 10 minutes to remove aggregates, and the supernatant further 
ultracentrifuged at 50000^ for 75 minutes to pellet the outer membranes. The outer 
membranes were resuspended in lOmM Tris-HCl, P H8 and the protein concentration 
measured by the Bio-Rad Protein assay, using BSA as a standard. 

Whole Extracts preparation 

Bacteria were grown overnight on a GC plate, harvested with a loop and resuspended 
in 1ml of 20mM Tris-HCl. Heat inactivation was performed at 56°C for 30' minutes. 

Western blotting 

Purified proteins (500ng/lane), outer membrane vesicles (Sag) and total cell extracts 
(25ug) derived from MenB strain 2996 were loaded onto a 12% SDS-polyacrylamide gel and 
transferred to a nitrocellulose membrane. The transfer was performed for 2 hours at 150mA at 
4°C, using transfer buffer (0.3 % Tris base, 1 .44 % glycine, 20% (v/v) methanol). The 
membrane was saturated by overnight incubation at 4°C in saturation buffer (10% skimmed 
milk, 0.1% Triton X100 in PBS). The membrane was washed twice with washing buffer (3% 
skimmed milk, 0.1% Triton X100 in PBS) and incubated for 2 hours at 37°C with mice sera 
diluted 1 :200 in washing buffer. The membrane was washed twice and incubated for 90 
minutes with a 1 :2000 dilution of horseradish peroxidase labelled anti-mouse Ig. The 
membrane was washed twice with 0.1% Triton X100 in PBS and developed with the Opti- 
4CN Substrate Kit (Bio-Rad). The reaction was stopped by adding water. 

Bactericidal assay 

MC58 and 2996 strains were grown overnight at 37°C on chocolate agar plates. 5-7 
colonies were collected and used to inoculate 7ml Mueller-Hinton broth. The suspension was 
incubated at 37°C on a nutator and let to grow until OD 620 was in between 0.5-0.8. The 
culture was aliquoted into sterile 1 .5ml Eppendorf tubes and centrifuged for 20 minutes at 
maximum speed in a microfuge. The pellet was washed once in Gey's buffer (Gibco) and 
resuspended in the same buffer to an OD 620 of 0.5, diluted 1 :20000 in Gey's buffer and stored 
at 25°C. 
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50^1 of Gey's buffer/1% BSA was added to each well of a 96-well tissue culture plate. 
25^1 of diluted (1:100) mice sera (dilution buffer: Gey's buffer/0.2% BSA) were added to 
each well and the plate incubated at 4°C. 25jal of the previously described bacterial 
suspension were added to each well. 25pl of either heat-inactivated (56°C waterbath for 30 
minutes) or normal baby rabbit complement were added to each well. Immediately after the 
addition of the baby rabbit complement, 22^il of each sample/well were plated on Mueller- 
Hinton agar plates (time 0). The 96-well plate was incubated for 1 hour at 37°C with rotation 
and then 22\xl of each sample/well were plated on Mueller-Hinton agar plates (time 1). After 
overnight incubation the colonies corresponding to time 0 and time Ih were counted. 

Gene Variability 

The ORF4 and 919 genes were amplified by PCR on chromosomal DNA extracted 
from various Neisseria strains (see list of strains). The following oligonucleotides used as 
PCR primers were designed in the upstream and downstream regions of the genes: 

orf 4. 1 (forward) CGAATCCGGACGGCAGGACTC 

orf 4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 

919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 
919.6 (reverse) CAAATAAGAAAGGAATTTTG or 

GGTATCGCAAAACTTCGCCTTAATGCG 

The PCR cycling conditions were: 

1 cycle 2 min. at 94° 

30 cycles 30 sec. at 94° 

30 sec. at ~ 54° or - 60° ( in according to Tm of the primers) 
40 sec. at 72° 
1 cycle 7 min. at 72° 

The PCR products were purified from 1 % agarose gel and sequenced using the 
following primers: 

orf 4.1 (forward) CGAATCCGGACGGCAGGACTC 
orf 4.2 (forward) CGACCGCGCCTTTGGGACTG 
orf 4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 
orf 4.4 (reverse) TCTTTGAGTTTGATCCAACC 
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919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 

919.2 (forward) ATCCTTCCGCCTCGGCTGCG 

919.3 (forward) AAAACAGCGGCACAATCGAC 

919.4 (forward) ATAAGGGCTACCTCAAACTC 

919.5 (forward) GCGCGTGGATTATTTTTGGG 

919.6 (reverse) CAAATAAGAAAGGAATTTTG or 

GGTATCGCAAAACTTCGCCTTAATGCG 

919.7 (reverse) CCCAAGGTAATGTAGTGCCG 

919.8 (reverse) TAAAAAAAAGTTCGACAGGG 

919.9 (reverse) CCGTCCGCCTGTCGTCGCCC 

919.10 (reverse) TCGTTCCGGCGGGGTCGGGG 

All documents cited herein are incorporated by reference in their entireties. 
The following Examples are presented to illustrate, not limit, the invention 

EXAMPLE 1 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PCR) assay in order to clone the ORFs as 
indicated: 



Table 1: Oligonucleotides used for PCR for Examples 2-10 



ORF 
279 


Primer 

Forward 

Reverse 


Sequence 

CGCGGATCCCATATG-TTGUC! f «haat^a^TT 
<SEQID3021> 

CCCGCTCGAG-TTTAGAAGCGGGCGGCAA <SEQ 
ID 3022> 


Restriction sites 
BamHI-Ndel 

Xhol 


519 


Forward 
Reverse 


CGC G G ATCCC ATATG -TTC AAATr; rrmvs Tr>njr* a, 
<SEQ ID 3023> 

CCCGCTCGAG-TTTGGCGGTTTTGCTGC <SEQ ID 
3024> 


BamHI-Ndel 
Xhol 


576 


Forward 
Reverse 


CGCGGATCCCATATG-GCCGnnr^rnnA T r> T 
<SEQID3025> 

C C CG CTC G AG-ATTTACTTTTTTn ATrxTnn^ 
<SEQ ID 3026> 


BamHI-Ndel 
Xhol 


919 


Forward 
Reverse 


CGCGGATCCCATATG-TGnr.AAAnrAA^A^^ ATC 
<SEQID3027> 

CCCGCTCGAG-CGGGCGGTATTCGGG <SEQ ID 
3028> 


BamHI-Ndel 
Xhol 


121 


Forward 


CGCGGATCCCATATG-GAAAHAnAnr.TTTAn^ T 
<SEO ID 3029> 


BamHI-Ndel 
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Reverse 


128 


Fnrwarri 

1 \J1 YVGM VJ 




Reverse 


206 


I UPI WCM U 




Reverse 


287 


Forward 




Reverse 


406 


Forward 




Reverse 



CCCGCTCGAG-ATAATAATATCCCGCGCCC <SEQ 
ID 3030> 

CGC GGATCCCATATG -ACTGACAACGCACT <SEQ 
ID 3031> 

CCCG CTCGAG -GACCGCGTTGTCGAAA <SEQ ID 
3032> 

CGC GGATCCCATATG -AAACACCGCCAACCGA 
<SEQ ID 3033> 

CCCGCTCGAG-TTCTGTAAAAAAAGTATGTGC 
<SEQ ID 3034> 

CCG GAATTCT A GCTAGC -CTTTCAGCCTGCGGG 
<SEQ ID 3035> 

CCCG CTCGAG -ATCCTGCTC I I I I I I GCC <SEQ ID 
3036> 

CGC GGATCCCATATG -TGCGGGACACTGACAG 
<SEQ ID 3037> 

CCCG CTCGAG -AGGTTGTCCTTGTCTATG <SEQ 
ID 3038> 



Xhol 

BamHI-Ndel 
Xhol 

BamHI-Ndel 
Xhol 

EcoRI-Nhel 
Xhol 

BamHI-Ndel 
Xhol 



Localization of the ORFs 

The following DNA and amino acid sequences are identified by titles of the following 
form: [g, m, or a] [#].[seq or pep], where "g" means a sequence from//, gonorrhoeae, "m" 
means a sequence from N. meningitidis B, and "a" means a sequence from N. meningitidis A; 
"#" means the number of the sequence; "seq" means a DNA sequence, and "pep" means an 
amino acid sequence. For example, "gOOl.seq" refers to an N. gonorrhoeae DNA sequence, 
number 1. The presence of the suffix "-1" to these sequences indicates an additional sequence 
found for the same ORF, thus, data for an ORF having both an unsuffixed and a suffixed 
sequence designation applies to both such designated sequences. Further, open reading 
frames are identified as ORF #, where "#" means the number of the ORF, corresponding to 
the number of the sequence which encodes the ORF, and the ORF designations may be 
suffixed with ".ng" or ".a", indicating that the ORF corresponds to a N. gonorrhoeae sequence 
or a N. meningitidis A sequence, respectively. The word "partial" before a sequence indicates 
that the sequence may be a partial or a complete ORF. Computer analysis was performed for 
the comparisons that follow between "g", "m", and "a" peptide sequences; and therein the 
"pep" suffix is implied where not expressly stated. Further, in the event of a conflict between 
the text immediately preceding and describing which sequences are being compared, and the 
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designated sequences being compared, the designated sequence controls and is the actual 
sequence being compared. 



ORF: contig: 

279 gnm4.seq 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 3039>* 
m279.seq 

1 ATAACGCGGA TTTGCGGCTG CTTGATTTCA ACGGTTTTCA GGGCTTCGGC 

51 AAGTTTGTCG GCGGCGGGTT TCATCAGGCT GCAATGGGAA GGTACGGACA 

101 CGGGCAGCGG CAGGGCGCGT TTGGCACCGG CTTCTTTGGC GGCAGCCATG 

151 GCGCGTCCGA CGGCGGCGGC GTTGCCTGCA ATCACGATTT GTCCGGGTGA 

201 GTTGAAGTTG ACGGCTTCGA CCACTTCGCT TTGGGCGGCT TCGGCACAAA 

251 TGGCTTTAAC CTGCTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 

3 01 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA GGCGCACGAG 

351 TTTGACCGCG TCGGCAAAAT TCAATGCGCC GGCGGCAACG AGTGCGGTGT 

401 ATTCGCCGAG GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 

This corresponds to the amino acid sequence <SEQ ID 3040; ORF 279>- 
m279.pep 

1 ITRICGCLIS TVFRASASLS A AGFIRLOWE GTDTGSGRAR LA PAflT.aanM 
51 ARPTAAALPA ITICPGELKL TASTTSLWAA SAQMALTCSS SKPRIAAIAP 
101 TPCGTADCIS SARRRTSLTA SAKFNAPAAT SAVYSPRLCP ATAAGVLPPA 
151 SK* 



S?9^Lq Win8 Partial DNA Sequence was ide ntified in N. gonorrhoeae <SEQ ID 3041>: 

-i lllltTl S& tttgcggctg cttgatttca acggttttga gtgtttcggc 
,1 aagtttgtcg goggcgggtt tcatcaggct gcaatgggaa ggaacggata 
ccggcagcgg cagggcgcgt ttggctccgg cttctttggc ggcagccatg 
gtgcgtccga cggcggcggc gttgcctgca atcacgactt gtccgggcqa 
gttgaagttg acggcttcga ccacttcgcc ctgtgcggat tcggcacaaa 



51 
101 
151 
201 
251 
301 
351 
401 



tctgcctgac ctgttcatct tccaaaccca aaatggccgc cattgcgcct 
acgccttgcg gtacggcgga ctgcatcagt tcggcgcgca ggcggacgag 
tttgacggca tcggcaaaat ccaatgcttc ggcggcgaca agcgcggtgt 



~' 33^33*-a«i-a ciy uqcqqCQt 

451 JccSSg 9 9Ct9t9tCC9 ^c^cggcgg caggcgtttt gccgcccact 

This corresponds to the arnino acid sequence <SEQ ID 3042; ORF 279.ng>- 

1 MTRI CGCLIS TVLSVSASLS AAGFIRLOWF. nrnTn^,, LAPASLAAAM 
" ^AALPA I TTCP GELKL TASTTSPCAD SAQXCLtSS SKP^IAP 
101 TPCGTADCIS SARRRTSLTA SAKSNASAAT SAVYSPRLCP ATAAGVLPPT 

lbl SK* 

ORF 279 shows 89.5% identity over a 152 aa overlap with a predicted ORF (ORF 279 n S ) 
from A', gonorrhoeae; K *- 1L &J 



10 20 30 



m279.pep I TRI 'j-'j'CL ^ S TV FRAS AS LS AAGF I RLQWEGTDTGSGRARLAP AS IiAAAMARPTAAAL PA 

: M I I I I I I I h : I I I I I I || M M I I I I I I 1 1 I I I I I I m mimiiiTTiTn 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m279 . pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 

II IMMIIIlllll I llh ilMMI I- IIMIIMII INI Ml MIIIMI 

9279 ITTCPGELKLTASTTSPCADSAQICLTCSSSKPKMAAIAPTPCGTADCISSARRRTSLTA 

70 80 90 100 HO 120 

130 140 150 

m279 . pep SAKFNAPAATSAVYSPRLCPATAAGVLPPASKX 

III I I ' I I M M I M MM MMMM : M I 

9279 SAKSNASAATSAVYSPRLCPATAAGVLPPTSKX 

130 140 150 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3043>: 

a279. seq 

1 ATGACNCNGA TTTGCGGCTG CTTGATTTCA ACGGTTTNNA GGGCTTCGGC 

51 GAGTTTGTCG GCGGCGGGTT TCATGAGGCT GCAATGGGAA GGTACNGACA 

101 CNGGCAGCGG CAGGGCGCGT TTGGCGCCGG CTTCTTTGGC GGCAAGCATA 

151 GCGCGCTCGA CGGCGGCGGC ATTGCCTGCA ATCACGACTT GTCCGGGCGA 

201 GTTGAAGTTG ACGGCTTCAA CCACTTCATC CTGTGCGGAT TCGGCGCAAA 

251 TTTGTTTTAC CTGTTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 

301 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA NGCGCACGAG 

351 TTTGACCGCG TCGGCAAAAT CCAATGCGCC GGCGGCAACN AGTGCGGTGT 

401 ATTCGCCGAN GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 

4 51 TCCGAATAG 

This corresponds to the amino acid sequence <SEQ ID 3044; ORF 279.a>: 

a27 9 .pep 

1 MTXICGCLIS TVXRASASLS AAGFMRLQWE GTDTGSGRAR LAPASLAASI 
51 ARSTAAALPA ITTCPGELKL TASTTSSCAD SAQICFTCSS SKPRIAAIAP 
101 TPCGTADCIS SARXRTSLTA SAKSNAPAAT SAVYSPXLCP ATAAGVLPPA 
151 SE* 

m279/a279 ORFs 279 and 279.a showed a 88.2% identity in 152 aa overlap 

10 20 30 40 50 60 

m27 9 . pep ITRICGCLISTVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 

: I MINIMI M I I M : i I I M M I I I 1 I I I | | I I I I I M : : I I I M M I I 

a279 MTXICGCLISTVXRASASLSAAGFMRLQWEGTDTGSGRARLAPASLAASIARSTAAALPA 

10 20 30 40 50 60 

7 0 80 90 100 110 120 

m27 9 . pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 

> > I I I I I M I I I I I I I Ml: : I I I I I I I ! I I M I | M M I Ml I I ! I I | 

ITTCPGELKLTASTTSSCADSAQICFTCSSSKPRIAAIAPTPCGTADCIS SARXRTSLTA 
7 0 80 90 100 110 120 



a279 



130 140 150 

m2 7 9 . pep S AKFNAPAAT S AVYS PRLCP ATAAG VL P PASKX 

Ml M I I I I I I II M I II I I M I II I I M : I 
a27 9 SAKSNAPAAT S AVYS PXLC PATAAGVLP PASEX 

130 140 150 



519 and 519-1 gnm7.seq 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3045>: 

m519.seq (partial) 

1 ..TCCGTTATCG GGCGTATGGA GTTGGACAAA ACGTTTGAAG AACGCGACGA 

51 AATCAACAGT ACTGTTGTTG CGGCTTTGGA CGAGGCGGCC GGGg CTTgGG 

101 GTGTGAAGGT TTTGCGTTAT GAGATTAAAG ACTTGGTTCC GCCGCAAGAA 

151 ATCCTTCGCT CAATG CAGGC GCAAATTACT GCCGAACGCG AAAAACGCGC 

201 CCGTATCGCC GAATCCGAAG GTCGTAAAAT CGAACAAATC AACCTTGCCA 
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251 
301 
351 
401 
451 
501 
551 



GTGGTCAGCG CGAAGCCGAA ATCCAACAAT CCGAAGGCGA GGCTCAGGCT 
GCGGTCAATG CGTCAAATGC CGAGAAAATC GCCCGCATCA ACCGCGCCAA 
AGGTGAAGCG GAATCCTTGC GCCTTGTTGC CGAAG CCAAT GCCGAAGCCA 
TCCGTCAAAT TGCCGCCGCC CTTCAAACCC AAGGCGGTGC GGATGCGGTC 
AATCTGAAGA TTG CGGAACA ATACGTCGCT GCGTTCAACA ATCTTGCCAA 
AGAAAGCAAT ACGCTGATTA TGCCCGCCAA TGTTGCCGAC ATCGGCAGCC 
TGATTTCTGC CGGTATGAAA ATTATCGACA GCAGCAAAAC CGCCAAaTAA 

This corresponds to the amino acid sequence <SEQ ID 3046* ORF 519>- 

m519.pep (partial) . ' 

1 . . SVI GRMELDK TFEERDEINS TWAALDEAA GAWGVKVLRY EIKDLVPPQE 

51 ILRSMQAQIT AEREKRARIA ESEGRKIEQI NLASGQREAE IQQSEGEAQA 

101 AVNASNAEKI ARINRAKGEA ESLRLVAEAN AEAIRQIAAA LQTQGGADAV 

151 NLKIAEQYVA AFNNLAKESN TLIMPANVAD IGSLISAGMK IIDSSKTAK* 

The following partial DNA sequence was identified in K gonorrhoeae <SEO ID 3047>- 

a519 . sen ^ 



g519 . seq 

1 atggaatttt tcattatctt gttggcagcc gtcgccgttt tcggcttcaa 

51 atcctttgtc gtcatccccc agcaggaagt ccacgttgtc gaaaggctcg 

101 ggcgtttcca tcgcgccctg acggccggtt tgaatatttt gattcccttt 

151 atcgaccgcg tcgcctaccg ccattcgctg aaagaaatcc ctttagacgt 

201 acccagccag gtctgcatca cgcgcgataa tacgcaattg actgttgacg 

251 gcatcatcta tttccaagta accgatccca aactcgcctc atacggttcg 

301 agcaactaca ttatggcaat tacccagctt gcccaaacga cgctgcgttc 

351 cgttatcggg cgtatggagt tggacaaaac gtttgaagaa cgcgacgaaa 

4 01 tcaacagtac cgtcgtctcc gocctcgatg aagccgccgg ggcttggggt 

451 gtgaaagtcc tccgttacga aatcaaggat ttggttccgc cgcaagaaat 

501 ccttcgcgca atgcaggcac aaattaccgc cgaacgcgaa aaacgcgccc 

551 gtattgccga atccgaaggc cgtaaaatcg aacaaatcaa ccttgccagt 

601 ggtcagcgtg aagccgaaat ccaacaatcc gaaggcgagg ctcaggctgc 

651 ggtcaatgcg tccaatgccg agaaaatcgc ccgcatcaac cgcgccaaag 

701 gcgaagcgga atccctgcgc cttgttgccg aagccaatgc cgaagccaac 



- - "a>-yyyywyy citgcggccaa 

tctgaagatt gcgggacaat acgttaccgc gttcaaaaat cttgccaaag 
aagacaatac gcggattaag cccgccaagg ttgccgaaat cgggaaccct 
aattttcggc ggcatgaaaa attttcgcca gaagcaaaaa cggccaaata 



801 !!?!f?.f C ! tcaaacccaa agcggggcgg atgcggtcaa 

851 
901 
951 

This corresponds to the amino acid sequence <SEQ ID 3048* ORF 519 ne>- 

g519.pep ' • & - 

1 MEFFIILLAA VAVFGFKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAN 

251 RQIAAALQTQ SGADAVNLKI AGQYVTAFKN LAKEDNTR I K P AKVAE I GNP 

3 01 NFRRHEKFSP EAKTAK* 

ORF 519 shows 87.5% identity over a 200 aa overlap with a predicted ORF (ORF 519 n^ 
from N. gonorrhoeae: * &J 

m519/g519 



10 20 30 

m " PSP SVIGRMELDKTFEERDEINSTWAALDEAA 

90 100 HO 120 130 140 

40 50 60 70 80 90 

mS19 . pep GAWGVKVLRYE1KDLVPPQEILRSMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 
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MIMMI Ml IMIIIMMI I II I II II III I i IIMIMI II MM MM! 

9519 GAWGVKVLR YE I KDLVPPQE I LRAMQAQ I TAEREKRARI AESEGRKI EQ I NLASGQREAE 

150 160 170 180 190 200 

100 110 120 130 140 150 

m519 .pep I QQS EGEAQAAVNASNAEKI ARINRAKGEAES LRLVAEANAEAI RQ I AAALQTQGGADAV 

IMIMIIMM IMMMMIMMiM III I II Ml M Ml MMIIMMMMM 

95 1 9 IQQSEGEAQAAVNASNAEKIARINRAKGEAESLRLVAEANAEANRQI AAALQTQSGADAV 

210 220 230 240 250 260 

160 170 180 190 200 

m519 .pep NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL- ISAGMKIIDSSKTAK 

MM! M M I M M M M M I MIMIm : \ MIM 

g5 1 9 NLKI AGQ YVTAFKNLAKEDNTRI KPAKVAEI GNPNFRRHEKFS PE AKTAK 

270 280 290 300 310 

The following partial DNA sequence was identified in >V. meningitidis <SEQ ID 3049>: 

a519 . seq 

1 ATGGAATTTT TCATTATCTT GCTGGCAGCC GTCGTTGTTT TCGGCTTCAA 

51 ATCCTTTGTT GTCATCCCAC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GTATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAAT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

4 01 TCAACAGCAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG AGCTTGGGGT 

4 51 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC TGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

7 01 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

7 51 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTCGCCGC GTTCAACAAT CTTGCCAAAG 

851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 

901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3050; ORF 519.a>: 

a519 . pep 

1 MEFFIILLAA VWFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 

m519/a519 ORFs 519 and 519. a showed a 99.5% identity in 199 aa overlap 

10 20 30 

m519 .pep SVIGRMELDKTFEERDEINSTWAALDEAA 

t M I I I I I I I I I I I I I I I I I I I I : M I | I | 
a519 YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIGRMELDKTFEERDEINSTWSALDEAA 
90 100 110 120 130 140 

40 50 60 70 80 90 

m51 9 . pep GAWGVKVLR YE I KDLVPPQE I LRSMQAQ I TAEREKRARIAESEGRK I EQI NLASGQREAE 

M M I I I I I I I I I I I I M I I I I I I I I I I M I I I I II I I I I I I I I M | | | | | | | M | | M 1 
GAWGVKVLRYEIKDLVPPQEILRSMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 
150 160 170 180 190 200 

100 HO 120 130 140 150 



a519 



BNSDOCID: <WO 9857280A2J_> 



WO 99/57280 



PCT/US99/09346 

76 



m519.pep fMSEG^y^AEKI^ 

IQQSEGEAQAAVNASNAEKIARIN^KGE^^ 
210 220 2 30 240 250 260 

160 170 180 190 200 

m519 .pep NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSLISAGMKIIDSSKTAKX 

270 280 290 300 310 

Further work revealed the DNA sequence identified in N. meningitidis <SEO ID 305 1>- 

m519-l.seq ^ 
1 ATGGAATTTT TCATTATCTT GTTGGTAGCC GTCGCCGTTT TCGGTTTCAA 
51 ATCCTTTGTT GTCATCCCAC AACAGGAAGT CCACGTTGTC GAAAGGCTGG 
101 GGCGTTTCCA TCGCGCCCTG ACGGcCGGTT TGAATATTTT G^CcS^ 
151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC cSSSS 
201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTG^GaS 
251 GCATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 
301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 
351 CGTTATCGGG CGTATGGAGT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

r^rrltZ TGTTGTTGCG GCTTTGGACG AGGCGGCCGG GGC^GGG^ 
451 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 
501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC CGAACGCGAA AAACGcSctc 
551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

rflr^rnr M6CC,MM CCAACAATCC GAAGGCGAGG C^IggSgJ 
701 rvr-^ln^ TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 
701 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 
751 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGctSS 

«m I C I^ GATT GCGGAACAAT acgtcgctgc gttcaacaat ct?SccISg* 

851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 
901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3052- ORF 519-1>- 

m519-l. ' 

1 MEFFIILLVA VAVFGFK.gFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV SSS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWA ALdSSgIwg 

151 VKVLRYE I KD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR lSSSS 
301 1 S^ LKI MQ5f ™™ LAKESNTLIM PANVADIGSL 

The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3053>- 

g519-l.seq ^ 

1 ATGGAATTTT TCATTATCTT GTTGGCAGCC GTCGCCGTTT TCGGCTTCAA 

51 ATCCTTTGTC GTCATCCCCC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

?S ^ TTTCCA TCGCGCCC ^ ACGGCCGGTT TGAATAtS S3!SS 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGATAA TACGCAATTG 

251 GCATCATCTA TTTCCAAGTA ACCGATCCCA AACTCGCCTC ATACGGTTCG 

?M ^^lrn A TTATGGCAAT TACCCAGCTT GCCCAAACGA CGC^gSc 

351 CGTTATCGGG CGTATGGAGT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

11, r^ CACTA ° CGTCGTC TCC GCCCTCGATG AAGCCGCCGG GGCtSJ 
50 I™ AC . GA AMCAAGGAT TTGGTTCCGC CGCmSE 



III S£££ ™~ 2-™ ™E 

601 
651 
701 

80S TCTGflAGATT GCSS A»C«T iSSSSS SSGZ£S SSSSS 



.„ 252S5 S252S5 ™zi ™SI 

651 

S -™ ™5 ~2 SSEK? 



ggtcagcgtg aagccgaaat ccaa^c g^gag^ ctcAggctg? 

GGTCAATGCG TCCAATGCCG AGAAAATCGC CCGCATCAAC SSSSS 
^2^» GGA ATCCCTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC" 
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851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 
901 ATTTCTGCCG GCATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3054; ORF 519-l.ng>: 

g519-l .pep 

1 MEFFIILLAA VAVFG FKSFV VIPQQEVHVV ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAE I QQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 



m519-l/g519-l ORFs 519-1 and' 519-1. ng showed a 99.0% identity in 315 aa 
overlap 

10 20 30 40 50 60 

g519-l.pep ME FFI I LLAAVAV FG FKS FWI PQQE VH WERLGRFHRALTAG LN I L I P FI DRVAYRHS L 
i I ! fM M:M M ! tl II I M ] II IN i I Ml i M I I ! ! I I I I i II II | I! M I i| I I I 
XU519-1 MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFI DRVAYRHSL 

10 20 30 40 50 60 

7 0 80 90 100 110 120 

g519-l.pep KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
I M I I I M M M i I t I I If I I II I I I I I I I M I I I I i I I M I | | | | | | M II I! I I I I II 
m519-l KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

7 0 80 90 100 110 120 

130 140 150 160 170 180 

g519-l.pep RMELDKT FEERDE IN S TWSAL DEAAGAWGVKVLRYE IKDLVP PQE I LRAMQAQ I TAERE 
f I 1 I I I I I I I I I I I I I | | | : | | M I I I I I II I I I I I I I I I I | II I I I I | : I | | | | f | | | | 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
130 140 150 160 170 180 



m519-l 



190 200 210 220 230 240 

g519-l.pep KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
> I I I I I I I I M I I I I I M I I I I I 1 I I I j | | | | | | | I I I I i I I I i | | I I M I I I I I j I I I I 
IU519-1 KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

190 200 210 220 230 240 

250 260 270 280 290 300 

g5 19-1. pep LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 

N I I I I M I I I I I I I I I I II I I 11 I M I I I I I I I I I | I I I I I I M I I I I 1 I I I | | 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
250 260 270 280 290 300 



m519-l 



310 

g519-l.pep ISAGMKIIDSSKTAKX 
I I 1 I I I I! I I I I I I I I 
m5 1 9 - 1 I SAGMKI I DS SKTAKX 

310 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3055>: 

a519-l.seq 

1 ATGGAATTTT TCATTATCTT GCTGGCAGCC GTCGTTGTTT TCGGCTTCAA 

51 ATCCTTTGTT GTCATCCCAC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GTATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAAT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 
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401 TCAACAGCAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG AGCTTGGGGT 
451 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC TCoSSSSS 
501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC TGAACGCGAA AAACGCGCCr 
551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA Sc^JgCcSgt 

601 ggtcagcgcg aagccgaaat ccaacaatcc gaagg?gag^ ctcIggSgc 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC SSSSS 
701 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGcSS? 

^AAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG MG^GgSa^ 
801 TCTGAAGATT GCGGAACAAT ACGTCGCCGC GTTCAACAAT CTTGOPAaar 
So" A^cS SSSSS CCCGCCAATG TtSSSJ CGGcSSg 

901 ATTTCTGCCG gtatgaaaat tatcgacagc agcaaaaccg ccaaataa 
This corresponds to the amino acid sequence <SEO ID 3056- ORF 51 Q 1 

a519-l.pep. ' d ' 

1 MEFFIILLAA VWFGFKSFV VIPQQEVHW ERLGR FHRAL TAGLNILIPF 
51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASyIs 
101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTVVS ALDEAAGAWP 
Ini S?"™ LVPPQEILRS MQAQITAERE KRARlSS S£2SS 
201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

3oi SSSSS SST" MWW ™ LAKE ™ p — - 

^rlap^ 19 " 1 ^ ^ "* 5 "- 1 - * ^.01 identity in 315 aa 

10 20 30 



a5i9-i. pep ^fiil^wvfgfks™^ 

— MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAG^ 

10 20 30 40 50 60 



m519-l 



a519-l. P ep ^^^^^^^^^^^^''^P^'F^^'^^'^^ ^ ^^Q^^^^I^^AS YGSSNYXMAXTQLAQTTX.RS VIG 

*519-1 KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKIASYGSS^ 

70 80 9° 100 no 120 



a519-l.pep 



130 140 !50 160 170 isn 

—i 

130 140 150 160 170 180 

190 200 210 220 

190 200 210 220 230 240 

a519-l.pep 

"° 260 270 280 9on 



280 290 300 



310 

a 5 1 9 - 1 . pep I SAGMK 1 1 DS SKTAKX 
I I I I 1 I II I t I I I I I | 
m5 1 9 - 1 i SAGMK 1IDSS KTAKX 

310 
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576 and 576-1 gnm22.seq 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3057>: 

m576.seq.. (partial) 

1 . .ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

51 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

101 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

151 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

201 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

251 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

301 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

351 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

4 01 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

451 GTGATTCCGG GTTGGACCGA AGgCGTACAG CTTCTGAAAG AAGGCGGCGA 

501 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

551 GCGACAAAAT CGGTCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

601 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

651 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3058; ORF 576>: 

sn576.pep. . (partial) 

1 . .MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 
51 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

101 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

151 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

201 KIGAPENAPA KQPAQVDIKK VN + 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3059>: 

g576.seq. . (partial) 

1 . .atgggcgtgg acatcggacg ctccctgaaa caaatgaagg aacagggcgc 

51 ggaaatcgat ttgaaagtct ttaccgatgc catgcaggca gtgtatgacg 

101 gcaaagaaat caaaatgacc gaagagcagg cccaggaagt gatgatgaaa 

151 ttcctgcagg agcagcaggc taaagccgta gaaaaacaca aggcggatgc 

201 gaaggccaac aaagaaaaag gcgaagcctt cctgaaggaa aatgccgccg 

251 aagacggcgt gaagaccact gcttccggtc tgcagtacaa aatcaccaaa 

301 cagggtgaag gcaaacagcc gacaaaagac gacatcgtta ccgtggaata 

351 cgaaggccgc ctgattgacg gtaccgtatt cgacagcagc aaagccaacg 

401 gcggcccggc caccttccct ttgagccaag tgattccggg ttggaccgaa 

451 ggcgtacggc ttctgaaaga aggcggcgaa gccacgttct acatcccgtc 

501 caaccttgcc taccgcgaac agggtgcggg cgaaaaaatc ggtccgaacg 

551 ccactttggt atttgacgtg aaactggtca aaatcggcgc acccgaaaac 

601 gcgcccgcca agcagccgga tcaagtcgac atcaaaaaag taaattaa 

This corresponds to the amino acid sequence <SEQ ID 3060; ORF 576.ng>: 

g576.pep. . (partial) 

1 ..MGVDIGRSLK QMKEQGAEID LKVFTDAMQA VYDGKEIKMT EEQAQEVMMK 
51 FLQEQQAKAV EKHKADAKAN KEKGEAFLKE NAAEDGVKTT ASGLQYKITK 

101 QGEGKQPTKD DIVTVEYEGR LIDGTVFDSS KANGGPATFP LSQVIPGWTE 

151 GVRLLKEGGE ATFYIPSNLA YREQGAGEKI GPNATLVFDV KLVKIGAPEN 

2 01 APAKQPDQVD IKKVN* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

m576/cr576 ORFs 576 and 57 6.ng showed a 97.2% identity in 215 aa overlap 

10 20 30 40 50 60 

m57 6 . pep MQQASYAMGVDIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQ 

I'll I II I M I I I I I I I I I I : I I I | I | M I I I I I | i | | | M | | | | j M 
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557 6 MGVDIGRSLKQMKEQGAEIDLKVFTDAMQAVYDGKEIKMTEEQAQEVMMKFLQ 

10 20 30 40 50 

70 80 90 100 no ion 
m576.pep EQQAfCAVEKHKADAKANKEKGF.APT.K-iTMnBirr,«^r T , m ^ c .^r ^ PTKD ^2V 



g57 6 



70 80 90 100 no 

EMA^yEKH^DA^KGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQ! . 

EQQAKAVEKHKADAKANKEKGEAFLKENAAEDGVKTTASGLQYKITKQ^ 

60 70 8 ° 90 100 no 



130 140 150 160 170 inn 

m57 6 . pep TVEYEGRLIDGTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYRE 

1 1 1 1 1 1 1 1 1 1 1 1 1 ' 1 1 ' 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 • i , i iiimimm mm 

g576 TVEYEGRLIDGTVFDSSKANGGPATFPLSQVIPGWTEGVRLLKEGGEATFYIPSNLAY^ 
120 130 140 150 160 170 



±y» -iuu 210 220 

mS7 6 . pep QGAGDKIGPNATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 
IIMrlllllllllMlllllllllllllllll | | | | ] , | , , 

g57 6 QGAGEKIGPNATLVFDVKLVKIGAPENAPAKQPDQVDIKKVNX 
180 190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 3061>- 

a576.seq ^ 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCtJS 

201 GCAAATGAAG gaacagggcg cggaaatcga tttgaaagtc tttaccgaag 

2 51 ccatgcaggc agtgtatgac ggcaaagaaa tcaaaatgac cgaagagcag 

301 gctcaggaag tcatgatgaa attccttcag gaacaacagg ctaaagccgt 

351 agaaaaacac aaggcggacg cgaaggccaa taaagaaaaa ggcgaagcct 

IV: IZ™?* G * ^tgccgcc aaagacggcg tgaagaccac tgcttccggc 

4 51 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGAC CAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

ISi ar™^ GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCgI 

5m rrr*r*tZ15 ™ CATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

701 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG C^G^gS 

801 CATCAAAAAA GTAAATTAA ^iwiftbitfafl 

This corresponds to the amino acid sequence <SEQ ID 3062; ORF 576.a>- 

a57 6.pep ' 

* MNTIFKISAL TLSAALALSA CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEO 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSO 

ll\ ^™ GVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLvJdWlJ 
251 KIGAPENAPA KQPAQVDIKK VN* 

m576/a576 ORFs 576 and 576. a showed a 99.5% identity in 222 aa overlap 

m57 6 oeo 10 20 3 © 

V MQQASYAMGVDIGRSLKQMKEQGAEIDLKV 

I i i i i i i ii 



a57 6 



CGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGVDI^ 

30 40 50 60 70 80 

40 50 60 70 80 

n>576.pep "^"^F ^ *P ^^t^ T" OAQEVMMJC QE Q Q AICAVE KHICADAICAN KE KG E A.FXj ICEN 

a „- 11 'J 1 1 1 1 1 1 1 ' I I I I I I I I I M I I I l l l | | | i I I 1 1 I | i 1 1 | | | || I | | | I , | | | ,777 

a576 FTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKHI^DAKA^ 

90 100 HO 120 130 140 
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100 110 120 130 140 150 

m57 6.pep KDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLIDGTVFDSSKANGGPVTFPLSQ 
I I f I I I I I I M I I I I I I I | I | | | | | I | | | | | M M f I II 1 I I I ! I I I I I I I I M I t I I I I 
a57 6 KDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLIDGTVFDSSKANGGPVTFPLSQ 

150 160 170 180 190 200 

160 170 180 190 200 210 

m576 . pep VI PGWTEGVQLLKEGGEATFYI PSNLAYREQGAGDKIGPNATLVFDVKLVKIGAPENAPA 

M I I I I I I I I I I I I I I I I t I I I I M I I I I I I I I M II II I I I I I I I I I i I I I I I I I I 1 I 
a576 VILGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPNATLVFDVKLVKIGAPENAPA 

210 220 230 240 250 260 

220 

m5 7 6 . pep KQPAQVDI KKVNX 

I I I I I I I I I J I M 
a57 6 KQPAQVDIKKVNX 

270 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 3063>: 

m576-l . seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

4 01 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

4 51 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

701 GCGACAAAAT CGGTCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

801 CATCAAAAAA GTAAATTAA 



This corresponds to the amino acid sequence <SEQ ID 3064; ORF 576- 1>: 

m57 6-l . pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVDI KK VN* 

The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3065>: 

g576-l -seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTGCCGCG CAGGGCGACA CCTCTTCAAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC AATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 ACAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGATG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCCCAGGAAG TGATGATGAA ATTCCTGCAG GAGCAGCAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGATG CGAAGGCCAA CAAAGAAAAA GGCGAAGCCT 

4 01 TCCTGAAGGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGT 

4 51 CTGCAGTACA AAATCACCAA ACAGGGTGAA GGCAAACAGC CGACAAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACCGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG CCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACGG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 
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701 GCGAAAAAAT CGGTCCGAAC GCCACTTTGG TATTTGACGT GAAACTGGTr 

Si SSS CGCGCCCGCC MTOG 

This corresponds to the amino acid sequence <SEQ ID 3066- ORF 576-1 ne>- 

g576-l.pep ' • 6 • 

* MNT^KISAL TLSAALALBA CGKKEAAPAS ASEPAAASAA QGDTSSIGST 
51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTDAMQAVYD GKE^SeS 
101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA kSJStIsG 
151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN gSSIIo 

111 lrr*l™T£ LLKEGGEA TF YIPSNLAYRE QGAGEKIGPN JtlJfDVKLV 
251 KIGAPENAPA KQPDQVDIKK VN* UVM,V 

ovL 6 ^^ 6 ' 1 ° RFS 576-1 ^ 576 ~ 1 - ng Sh ° Wed 3 97 " 8% id -tity in 272 aa 

g576-l.pep MNT I FK I S ALTL S AALAL S ACGKKE AAPASASE PAAAS AAQG DTS S I G STMQQAS YAMG V 

MNT I FKISALTLSAAIALSACGKKE AAPASASE PAAAS SAQG 

10 20 30 40 ^ 



m576-l 

40 50 io 

g57 6-1 . pep DIGRSLKQMKEQGAEIDLOTFTDAMOAVvnc^T^^L^ "L. 3Akav ^h 

m57 6-l «....■ 



m576-l 



70 80 90 100 no 

1 1 11 N 1 11 M 1 1 1 1 : 1 1 I I M i M I ! I I ! j I I I i i i i i m ( i i i i 

D^SLKQM^QGAEID^ 

70 80 90 100 no 120 

KADAJ^KEK^^ 

' ' N 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I | | | | | I I I I I | I I . | , ,| | i i i i i i i 

kadakankekgeaflkenaakdg™ 

130 140 150 160 170 180 

190 200 210 220 ?™ o/n 

gtvfdsskanggpvtfplsqvipgwtegvqllkeggeatfyipsniay^ 

190 200 21 ° 220 230 240 



g576-l .pep AT LVFDVKL VKIGAPENAPAKQP DQVD I KKVNX 
I I I I I I I I I I I I | | | | | | | | | | | I | | | | , , , | 

ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 
250 260 270 



m576-l 



The following DNA sequence was identified in N. meningitidis <SEO ID 3067>- 

a57 6-1 . seq ^ 



1 


ATGAACACCA 


51 


ACTTTCCGCC 


101 


CTGCCGCCGC 


151 


ATGCAGCAGG 


201 


GCAAATGAAG 


251 


CCATGCAGGC 


301 


GCTCAGGAAG 


351 


AGAAAAACAC 


401 


TTCTGAAAGA 


451 


CTGCAATACA 


501 


CGACATCGTT 


551 


TCGACAGCAG 


601 


GTGATTCTGG 


651 


AGCCACGTTC 
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7 01 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 
751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 
801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3068; ORF 576-1. a>: 

a57 6-1 . pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VILGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 

a57€-l/m576-l ORFs 576-1 and 576-1. a showed a 99.6% identity in 272 aa 

overlap 



10 20 30 40 50 60 

a57 6-1 . pep MNT I FKI S ALTL S AALAL S ACGKKE AAP AS ASE PAAAS S AQG DTS S I GS TMQQAS YAMGV 

1 ' I M I M j I I I I j M I I I I I I I I I I I I i f M j } I I j I 1 M | ! E i I I I j I ! I | | I | | } M 
m57 6-l MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGV 

10 20 30 40 50 60 

70 80 90 100 110 120 

a57 6-1 . pep DIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 
I I I I I I I M I I M i I I II I I I II f i I I I I f I j [ I I | N ! I I I I I I I I | | | | | | | | ) | | M 
DIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 
70 80 90 100 110 - 120 



m57 6-l 



130 140 150 160 170 180 

a57 6-l.pep KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
N II I I I I I I I I I I I I I I M I I I I M I! I I I I I I It I | M | M | | | n | | | | | | | | M , | 
m57 6-l KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 

1 30 140 150 160 170 180 

190 200 210 220 230 240 

a57 6-l.pep GTV FD S S KANGG PVT F PL S QV I LGWTEGVQLLKEGGEAT FY I PSNLAYRE QGAGDKIGPN 

I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I M | M M I I I I I t I I | I || | Ml 

m576-l GTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPN 

190 200 210 220 230 240 

250 260 270 

a57 6- 1 . pep ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 
I I M I I II I I II I II M I J I I I I I I I I | M I M 
m57 6- 1 ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 

250 260 270 



919 gnm43.seq 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3069>: 

m919.seq 

1 ATGAAAAAAT ACCTATTCCG CGCCGCCCTG TACGGCATCG CCGCCGCCAT 

51 CCTCGCCGCC TGCCAAAGCA AGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG TCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TCGGCGGCGG CGGGGCCGTC TATACCGTTG TACCGCACCT 

201 GTCCCTGCCC CACTGGGCGG CGCAGGATTT CGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

3 01 TGCGCCCAAG CCTTTCAAAC CCCCGTCCAT TCCTTTCAGG CAAAACAGTT 

3 51 TTTTGAACGC TATTTCACGC CGTGGCAGGT TGCAGGCAAC GGAAGCCTTG 

4 01 CCGGTACGGT TACCGGCTAT TACGAACCGG TGCTGAAGGG CGACGACAGG 
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851 
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451 CGGACGGCAC AAGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 
51,1 CTCCGTCCCC CTGCCTGCCG GTTTGCGGAG CGGAAAAGCC CTTGTCCGCA 
TCAGGCAGAC GGGAAAAAAC AGCGGCACAA TCGACAATAC CGGCGGCACA 
CATACCGCCG ACCTCTCCcG ATTCCCCATC ACCGCGCGCA CAACAGCAAT 
CAAAGGCAGG TTTGAAGGAA GCCGCTTCCT CCCCTACCAC ACGCGCAACC 
AAATCAACGG CGGCGCGCTT GACGGCAAAG CCCCGATACT CGGTTACGCC 
GAAGACCCTG TCGAACTTTT TTTTATGCAC ATCCAAGGCT CGGGCCGTCT 
GAAAACCCCG TCCGGCAAAT ACATCCGCAT CGGCTATGCC GACAAAAACG 
AACATCCyTA CGTTTCCATC GGACGCTATA TGGCGGATAA GGGCTACCTC 
AAACTCGGAC AAACCTCCAT GCAGGGCATT AAGTCTTATA TGCGGCAAAA 
TCCGCAACGC CTCGCCGAAG TTTTGGGTCA AAACCCCAGC TATATCTTTT 
1001 TCCGCGAGCT TGCCGGAAGC AGCAATGACG GCCCTGTCGG CGCACTGGGC 
1051 ACGCCGCTGA TGGGGGAATA TGCCGGCGCA GTCGACCGGC ACTACATTAC 
1101 CTTGGGTGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 
1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CCGGCAGCGC GATTAAAGGC 
1201 GCGGTGCGCG TGGATTATTT TTGGGGATAC GGCGACGAAG CCGGCGAACT 
1251 TGCCGGCAAA CAGAAAACCA CGGGATATGT CTGGCAGCTC CTACCCAACG 
13 01 GTATGAAGCC CGAATACCGc CCGTAA ^«.«AACG 



This corresponds to the amino acid sequence <SEQ ID 3070- ORF 91 9>- 

m919.pep ' 

1 MKKYLFRAAL YGIAAAILAA CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 

51 GTTVGGGGAV YTWPHLSLP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SFQAKQFFER YFTPWQVAGN GSLAGTVTGY YKPVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

2 01 HTADLSRFP I TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 
251 ED PVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHP YVS I GRYMADKGYL 

3 01 KLGQTSMQGI KSYMRQNPQR IAEVLGQNPS YIFFRELAGS SNDGPVGALG 

™™ GEYhGA VDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGS Al KG 
401 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 

The following partial DNA sequence was identified in N.gonorrhoeae <SEO ID 3071>- 

1 ATGAAAAAAC ACCTGCTCCG CTCCGCCCTG TACGGcatCG CCGCCqccAT 
51 CctcgCCGCC TGCCAAAgca gGAGCATCCA AACCTTTCCG CAACCCGACA 
101 CATCCGTCAT CAACGGCCCG GACCGGCCGG CCGGCATCCC CGACCCCGCC 



201 GTCCATGCCC CACTGGGCGG CGCaggATTT TGCCAAAAGC CTGCAATCCT 
^^ CGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTT CAAAC CCCCGTGCAT TCCTTTCAGG CAAAGcGgTT 

351 TTTTGAACGC TATTTCACGC cgtGGCaggt tgcaggcaAC GGAAGcCTTG 

401 Caggtacggt TACCGGCTAT TACGAACCGG TGCTGAAGGG CGACGGCAGG 

451 CGGACGGAAC GGGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 

501 CTCCGTCCCC CTGCCTGCCG GTTTGCGGGG CGGAAAAAAC CTTGTCCGCA 



TCAGGCAGac ggGGAAAAAC AGCGGCACGA TCGACAATGC CGGCGGCACG 
CATACCGCCG ACCTCTCCCG ATTCCCCATC ACCGCGCGCA CAACGGcaat 
caaaGGCAGG TTTGAaggAA GCCGCTTCCT CCCTTACCAC ACGCGCAACC 
AAAtcaacGG CGGCgcgcTT GACGGCAAag cccCCATCCT CggttacgcC 
GAagaccCcG tcgaacttTT TTTCATGCAC AtccaaggCT CGGGCCGCCT 



601 CATACCGCCG ACCTCTCCCG ATTCCCCATC ACCGCGCGCA CAACGGcaat 

701 
751 
801 

851 AACAtccgTa tgtttccatc ggACGctaTA TGGCGGACAA AGGCTACCTC 
o» ^ tC999C a 9 ACCTCGAT GCAGGgcatc aaagcCTATA TGCGGCAAAA 
951 TCCGCAACGC CTCGCCGAAG TTTTGGGTCA AAACCCCAGC TATATCTTTT 
1001 TCCGCGAGCT TGCCGGAAGC GGCAATGAGG GCCCCGTCGG CGCACTGGGC 



1051 ACGCCACTGA TGGGGGAATA CGCCGGCGCA ATCGACCGGC ACTACATTAC 

1101 CTTGGGCGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACcJgSSg 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CAGGCAGCGC SSSSc 

III, JEST**** TGGA " ATTT TTGGGGTTAC GGCGACGAAG SSSSES 

1251 TGCCGGCAAA CAGAAAACCA CGGGATACGT CTGGCAGCTC CTGCCCAACG 

13 01 GCATGAAGCC CGAATACCGC CCGTGA «-«»^^ CTGCCCAACG 
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This corresponds to the amino acid sequence <SEQ ID 3072; ORF 919.ng>: 

g919 .pep 

1 MKKHLLRSAL YGIAAAILAA CQSRSIQTFP QPDTSVINGP DRPAGIPDPA 

51 GTTVAGGGAV YTWPHLSMP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SFQAKRFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDGR 

151 RTERARFPIY GIPDDFISVP LPAGLRGGKN LVRIRQTGKN SGTIDNAGGT 

201 HTADLSRFPI TARTTAI KGR FEGSRFLPYH TRKTQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

301 KLGQTSMQGI KAYMRQNPQR LAEVLGQNPS YIFFRELAGS GNEGPVGALG 

351 TPLMGEYAGA IDRHYITLGA PL FVATAHPV TRKALNRLIM AQDTGS AI KG 

401 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 



ORF 919 shows 95.9 % identity over a 441 aa overlap with a predicted ORF (ORF 919.ng) 
from N. gonorrhoeae: 

m919/g919 

10 20 30 40 50 60 

m919 . pep MKKYLFRAALYGlAAAIIiAACOSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

III : IHHIIIII|||||||||:||||||||||||MIIII|:|I1IMMI|:|IMI 
g919 MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m919 . pep YTWPHLSLPHWAAQDFAKSIiQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

I 1 1 1 1 M I : II I ! 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 M I M I I I I I 1 : 1 1 1 1 

g919 YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 

70 80 90 100 110 120 

130 140 150 160 170 180 

TU919 . pep YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPI YGIPDDFISVPLPAGLRSGKA 

M I Ml I! I II M I 1 1 I I I I ! I I M ; I i mi M M M Ml M 1 1 M M I M II M M 

g919 YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKN 

130 140 150 160 170 180 

190 200' 210 220 230 240 

m919 . pep LVRI RQTGKNSGT I DNTGGTHTADLSRFP I TARTTAI KGRFEGSRFLP YHTRNQ INGGAL 

I M I : I M M h 1 1 M I IM II I !M I M 1 1 1 I 1 1 1 1 II 1 Ml II 1 1 1 i 1 1 1 1 i 

9919 LVRI RQTGKNSGTIDNAGGTHTADLSRFPI TARTTAI KGRFEGSRFLPYHTRNQINGGAL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m919.pep DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

IMMMI MM I M II III I MMIM MMMIM I MMI IMIIMM II IM I II 

9919 DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m919.pep KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFREIiAGSSNDGPVGAIjGTPIjMGEYAGA 

I IM IMM M M I I M I I I I II M !! MM II I II M I M i-- II MUM 

9919 klgqtsmqgikaymrqnpqrlaevlgqnpsyiffrelagsgnegpvgalgtplmgeyaga 

31° 320 330 340 350 360 

370 380 390 400 410 420 

m9 1 9 . pep VDRH Y I TLGAPL FVATAH PVTRKAIiNRLI MAQDTGS AI KGAVRVDYFWGYGDEAGELAGK 

= I ■ I 1 1 1 1 1 [ 1 1 1 I M ! i i I 1 1 1 1 ! 1 1 ] 1 1 1 1 1 1 1 1 1 1 1 1 | M I r 1 1 1 1 1 1 i 1 1 1 I i I I ! 

g919 I DRHYI TLGAPLFVATAHPVTRKALNRLIMAQDTGSAI KGAVRVDYFWGYGDEAGELAGK 

370 380 390 400 410 420 
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430 440 
m9 1 9 . pep QKTTGYVWQLLPNGMKPEYRPX 

H M 1 1 1 1 1 II II I II 1 1 M II 

9919 QKTTGYVWQLLPNGMKPEYRPX 

430 440 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3073>- 

a919.seq 

1 ATGAAAAAAT ACCTATTCCG CGCCGCCCTG TGCGGCATCG CCGCCGCCAT 

51 CCTCGCCGCC TGCCAAAGCA AGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG TCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TCGGCGGCGG CGGGGCCGTT TATACCGTTG TGCCGCACCT 

201 GTCCCTGCCC CACTGGGCGG CGCAGGATTT CGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTTCAAAC CCCCGTCCAT TCCGTTCAGG CAAAACAGTT 

351 TTTTGAACGC TATTTCACGC CGTGGCAGGT TGCAGGCAAC GGAAGCCTTG 

4 01 CCGGTACGGT TACCGGCTAT TACGAGCCGG TGCTGAAGGG CGACGACAGG 

4 51 CGGACGGCAC AAGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 

501 CTCCGTCCCC CTGCCTGCCG GTTTGCGGAG CGGAAAAGCC CTTGTCCGCA 

551 TCAGGCAGAC GGGAAAAAAC AGCGGCACAA TCGACAATAC CGGCGGCACA 

601 CATACCGCCG ACCTCTCCCA ATTCCCCATC ACTGCGCGCA CAACGGCAAT 

651 CAAAGGCAGG TTTGAAGGAA GCCGCTTCCT CCCCTACCAC ACGCGCAACC 

7 01 AAATCAACGG CGGCGCGCTT GACGGCAAAG CCCCGATACT CGGTTACGCC 

7 51 GAAGACCCCG TCGAACTTTT TTTTATGCAC ATCCAAGGCT CGGGCCGTCT 

801 GAAAACCCCG TCCGGCAAAT ACATCCGCAT CGGCTATGCC GACAAAAACG 

851 AACATCCCTA CGTTTCCATC GGACGCTATA TGGCGGACAA AGGCTACCTC 

901 AAGCTCGGGC AGACCTCGAT GCAGGGCATC AAAGCCTATA TGCAGCAAAA 

951 CCCGCAACGC CTCGCCGAAG TTTTGGGGCA AAACCCCAGC TATATCTTTT 

1001 TCCGAGAGCT TACCGGAAGC AGCAATGACG GCCCTGTCGG CGCACTGGGC 

1051 ACGCGGCTGA TGGGCGAGTA CGCCGGCGCA GTCGACCGGC ACTACATTAC 

1101 CTTGGGCGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CCGGCAGCGC GATTAAAGGC 

1201 GCGGTGCGCG TGGAT TATTT TTGGGGATAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATATGT CTGGCAGCTT CTGCCCAACn 

1301 GTATGAAGCC CGAATACCGC CCGTAA ^GCCCAACG 

This corresponds to the amino acid sequence <SEQ ID 3074; ORF 919 a>- 

a919.pep 

1 MKKYL FRAA L CGIAAAILAA CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 

51 GTTVGGGGAV YTWPHLSLP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SVQAKQFFER YFTPWQVAGN GSLAGTVTGY YE PVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

201 HTADLSQFPI TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

301 KLGQTSMQGI KAYMQQNPQR LAEVLGQNPS YIFFRELTGS SNDGPVGALG 

351 TPLMGEYAGA VDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 

401 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 

m919/a919 ORFs 919 and 919.a showed a 98.6% identity in 441 aa overlap 

10 20 30 40 50 60 

m919 .pep MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

1 1 1 1 1 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | , | I , , 

a919 MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

10 20 30 40 50 60 

m919 . pep YTWPHLSLPHWAAQDFAKSLQSFRLGC^ 

1 1 1 1 I " I I I I I I I I I I I I I I I I I I I i I I I I I 1 I I I I | | | | || || II [|| I | | II I I M 
a919 YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 

70 80 9° 100 no 120 
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130 140 150 160 170 180 

m919.pep YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
> I I M I i I I I I I I I I i I I 1 I I I I I I j M I I I I I I I I I I I | | | | I | | | | M M | j | | | | | | 
a919 YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

3-30 140 150 160 170 180 

190 200 210 220 230 240 

m919 . pep LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
I I M t M I I I I I ! I I I E I I I I I I I I I : I I I I I M I I I I M I I M I I I I I I II 1 I I I I I M 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
190 200 210 220 230 240 



a919 



250 260 270 280 290 300 

m91 9 . pep DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
I I I I M I I I I f I I I I M I I II I I I I M I I I I I i I I M I I I I I I I I I ! I I I I I I I I | ! | M 
a919 DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m91 9 . pep KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
" I > I I ' I I I I :![: I I I I I I | | I I I I M I II I I I I I : I I M I M I I I I I I M II I I I I I 
KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
310 320 330 340 350 360 



a919 



370 380 390 400 410 420 

m91 9 . pep VDRHYITLGAPLFVATAHPVTRBCALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
I I I I I M I I 1 I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I I I M I I I I I I I M I I I I 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
370 380 390 400 410 420 



a919 



430 440 
m9 1 9 . pep QKTTGYVWQLLPNGMKPEYRPX 
I I M I I M i I I I I I I I I 1 I I | | 
a 9 1 9 QKTTGYVWQLLPNGMKPEYRPX 
430 440 



121 and 121-1 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3075>: 

ml 21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAG C TTGCCGATTT 

351 GCCGCTGCTG GCGxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

401 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

4 51 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

501 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

551 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

601 xxxxxxCAGC TTCCTTACGA CAAAAACGGT GCAAAGTCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACGCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCAT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGACG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGnATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 
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1051 GCAACCGGCG CATCCAAACC GTGTATTCTG AnCGCGGGAT ATTATTATTG " 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 3076; ORF 12 1>: 

ml 21 .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL Axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

201 xxQLPYDKNG AKSAQGNILP QLLDRLLAHP YFAQRHPKST GRELFAINWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICDGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 3077>- 

g!21.seq 

1 ATGGAAACAC AGCTTTACAT CGGCATTATG TCGGGAACCA GTATGGACGG 

51 GGCGGATGCC GTGCTGGTAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCTGACC GGTTGCGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCAC AGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCTCCGTGC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTtac AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACTGa cgcggatttT TACCGTCggc gacttcCGCA 

4 01 GCCGCGACCT TGCTGCCGGC GGacaAGGTG CGCCGCTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA TGACAGGGAA ACACGCGTGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGGCGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGAcgcgtg gacgcaggca 

601 cacTGGcagc TGCCTTACGA CAAAAacggt gcAAAGgcgg cacAAGGCAA 

651 catatTGCcg cAACTGCTCG gcaggctGCT CGCCcaccCG TATTTCTCAC 

701 AACCCcaccc aaAAAGCACG GGgcGCGaac TgtttgcccT AAattggctc 

751 gaaacctAcc ttgacggcgg cgaaaaccga tacgacgtat tgcggacgct 

801 ttcccgattc accgcgcaaA ccgTttggga cgccgtctca CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCTCAATG GGTGGAGGCG gccgCATTtg 

1001 cgtggttggC GGCGTGTTGG ATTAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCGACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 



This corresponds to the amino acid sequence <SEQ ID 3078; ORF 121 ne>- 

gl21.pep ' 6 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFTP YPDRLRRKLL 

51 DLQDTGTDEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRWLNIGGI ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ORF 121 shows 73.5% identity over a 366 aa overlap with a predicted ORF (ORF121.ng) 
from N. gonorrhoeae: 

m!21/gl21 



10 20 30 40 50 60 

ml 21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
!| l"l"lllllilllll|||:|||||||M||,t||M|| !ll]:!lllfMI:||! 
gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 

70 80 90 100 no 120 

ml21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I ' I ' : f I I I I I I I I I I I I I I I I I I I I 1 I I I I M f I M I M ! I I I I I I | M | | | | | || n 
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gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 HO 120 

130 140 150 160 170 180 

ml2 1 . pep AX^XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRVVLNIGGIANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDPCNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: : I ! I I M I I II : I I I | | | ( | | | I ] I I | | | | : | Mill 

gl21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

I 90 200 210 220 230 240 

250 260 270 280 290 300 

ml 21 . pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
M I I I I I Ml || IN | M 1 I I ,| || , M M t , ,,,, (Mill I l [ 1 1 1 1 1 

9 1 2 1 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 21 . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
1 ' 1 ! I I I I I I I M I I I H I : I I I I M I I I I I I i I I M I I M I I I I I I M I I M I M ! M 

g!21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
MINI 

gl21 GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3079>: 



al21 . seq 
1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 



ATGGAAACAC 
GGCGGATGCC 
AAGGGCACGC 
GATTTGCAGG 
GCAAGAACTC 
GTCAAAACCT 
ACCGTCAGAC 
GCCGCTGCTG 
GCCGCGACCT 
CACGAAGCCC 
CGGCGGGATT 
GCTTCGACAC 
CACTGGCAGC 
CATATTGCCG 
AACCCCACCC 
GAAACCTACC 
TTCCCGATTC 
CAGATGCCCG 
TTAATGGCGG 
CACCGCCGAA 
CATGGATGGC 
GCAACCGGCG 
A 



AGCTTTACAT 
GTACTGATAC 
CTTTACCCCC 
ACACAGGCGC 
AGCCGCCTGT 
CGCGCCGTCC 
ACGCGCCGGA 
GCGGAACGGA 
TGCGGCCGGC 
TGTTCCGCGA 
GCCAACATCA 
AGGACCGGGC 
TTCCTTACGA 
CAACTGCTCG 
TAAAAGCACG 
TTGACGGCGG 
ACCGCGCAAA 
TCAAATGTAC 
ATTTGGCAGA 
CTGAACCTCG 
GGCGTGTTGG 
CATCCAAACC 



CGGCATCATG 
GGATGGACGG 
TACCCCGGCA 
GGACGAACTG 
ACGCGCAAAC 
GACATTACCG 
ACACAGTTAC 
CTCAGATTTT 
GGACAAGGCG 
CGACAGGGAA 
GCGTACTCCC 
AATATGCTGA 
CAAAAACGGT 
ACAGGCTGCT 
GGGCGCGAAC 
CGAAAACCGA 
CCGTTTTCGA 
ATTTGCGGCG 
ATGTTTCGGC 
ATCCGCAATG 
GTCAACCGCA 
GTGTATTCTG 



TCGGGAACCA 
CGGCAAATGG 
GGTTACGCCG 
CACCGCAGCA 
CGCCGCCGAA 
CCCTCGGCTG 
AGCGTACAGC 
TACCGTCGGC 
CGCCGCTCGT 
ACACGCGCGG 
CCCCGACGCA 
TGGACGCGTG 
GCAAAGGCGG 
CGCCCACCCG 
TGTTTGCCCT 
TACGACGTAT 
CGCCGTCTCA 
GCGGCATCCG 
ACACGCGTTT 
GGTAGAAGCC 
TTCCCGGTAG 
GGCGCGGGAT 



GCATGGACGG 
CTGGGCGCGG 
CAAATTGCTG 
GGATGTTGTC 
CTGCTGTGCA 
CCACGGGCAA 
TTGCCGATTT 
GACTTCCGCA 
CCCCGCCTTT 
TACTGAACAT 
CCCGCCTTCG 
GATGCAGGCA 
CACAAGG CAA 
TATTTCGCAC 
AAATTGGCTC 
TGCGGACGCT 
CACGCAGCGG 
CAATCCTGTT 
CCCTGCACAG 
GCCGCGTTCG 
TCCGCACAAA 
ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 3080; ORF 121.a>: 

al21.pep 

METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 
DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 
TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 
HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 
HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 
ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 



1 
51 
101 
151 
201 
251 
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HI SkPCXL SSgf™ LNLDP ^— A WMAACW VNRIPGSPHK 
ml21/a!21 ORFs 121 and 121. a showed a 74.0% identity in 366 aa overlap 

10 20 30 40 



ml 2 1 . pep 
al21 



10 20 30 40 50 fin 

Ji 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I 

METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRiRiKiiiiiiiGiiEi 
10 20 30 40 50 60 



70 80 90 100 no l?n 

ml21. P ep fffLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 

1 1 1 1 : 1 M 1 1 I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I | | I | I • | | • | I I 

HRSRMLSQELSRLYAQTAAELLCSQNIAPSDITALGCHGQTVRHAPEHSYS^QLADLPLL 
70 80 90 100 HO 120 

130 140 150 160 170 180 

AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 
130 140 150 160 170 180 

190 200 21° 220 230 240 

ml21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 

al21 FAFGFDTGPGNMLMDAWMQAHWQLFYDI^GAKAAQGN^ 

190 200 210 220 230 240 

250 260 270 280 290 ^nn 

m!21 .pep ^LFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRN^ 
' llll|:|llllllllll| IIIIIMIIII||||| MIIMIIIIMIIII IIMIII 
al21 GP^LFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 3fin 

L^D^CFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
1 1 1 I I I I I I I I I II I I I I I : | M I I I I I | | | I I I : I N I : M I I I I I I I I I I I I I I I I I 

LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPH^TGASKPCIL 
310 3 20 330 340 



al21 

ml21 .pep 
al21 



ml21 .pep 
al21 



ml21.pep XAGYYYX 

I I I I I I 

al21 GAGYYYX 

Further work revealed the DNA sequence identified in N. meningitidis <SEO ID 3081>- 

ml 21-1 . seq 

Jt ^^AC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CGCGGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCACTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CAACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCtSg 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGACGCGTG GACGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG StT^ScJ? 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT aS££J£ 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGgSct 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA SKSSS 
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851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGNATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG ANCGCGGGAT ATTATTATTG 

1101 A 



This corresponds to the amino acid sequence <SEQ ID 3082; ORF 121-1>: 

ml 2 1-1. pep 

1 METQLYIGIM SGTSMDGADA 

51 DLQDTGADEL HRSRILSQEL 

101 TVRHAPEHGY SIQLADLPLL 

151 HEALFRDNRE TRAVLNIGGI 

201 HWQLPYDKNG AKAAQGNILP 

251 ETYLDGGENR YDVLRTLSRF 

301 LMADLAECFG TRVSLHSTAD 

351 ATGASKPCIL XAGYYY* 

ml21-l/gl21 ORFs 121-1 and 121. ng showed a 95.6% identity in 366 aa overlap 

10 20 30 40 50 60 

ml21-l . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
M I I M i M I I I I I I I M I I I I : I I I i I I i i I I | | M I I I I I f I I I : M 1 I I I M : I I 1 
g!21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 



VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 
SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 
AERTRIFTVG DFRSRDLAAG GQGAPLVPAF 
ANISVLPPDA PAFGFDTGPG NMLMDAWTQA 
QLLDRLLAHP YFAQPHPKST GRELFALNWL 
TAQTVCDAVS HAAADARQMY ICGGGIRNPV 
LNLDPQWVEA AX FAWLAACW INRIPGSPHK 



70 80 90 100 110 120 

ml21-l . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I M I : I f I i M I I I ! M I I I t I I I I I i I ! I I I I I I ) I || I I I I I I I M I I II I I M I I I 
g!21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l.pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLN I GGI ANISVLPPDA 
I i I I 1 I I I I I f I I I I I I 1 I I I I I I I I M M I I I I I I : I I I I : | M I I I I I I I I I I I I I 
g!21. AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21-l . pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
I I I I I I I I I I I f j I I I I M I I I I ! I I I I I I I I t f I I I I I I I I I M I I I I I I : I M I M I 
9121 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l . pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
M I I M M I I I I I ! I I 1 I I I I I I I | | | | | | | | | | | | | M I I I I I I I I I I ! I I I | | | M I 
9121 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I M I II I I I I I I I 1 I I I I : II I I I M I I II I I I I I II I 1 I | I | || | | | || | | | | M | | 
g!21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml21-l.pep XAGYYYX 
I I I I I I 

gl21 GAGYYYX 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3083>: 

al21-l.seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 
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This corresponds to the amino acid sequence <SEQ ID 3084; ORF 121-1 a>- 

al21-l.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMOA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

351 ™sSciL GAGS'™ LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

ml21-l/al21-l ORFs 121-1 and 121-1. a showed a 96.4% identity in 366 aa overlap 



10 



ml21-l.pep 
al21-l 

ml21-l.pep 
al21-l 

ml21-l.pep 
al21-l 

ml21-l .pep 
al21-l 

ml 21-1 . pep 
al21-l 



20 



30 40 50 60 

?fT?^Tf?f MSGTSMDGADA ^ 

N I M M I I i I M M ! I i M I M I I I i | j M j I ! j f I | | | M | | | | [ : | i i I t m M , i I 

METQLYI GIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTP YPGRLRRKLL DLQD^ 
10 20 30 40 



50 



60 



70 80 90 100 110 120 

HRSR1LSQELSRLYAQTAAELLC 

^il:|IMIIMIII||i||||||||U|i| M ||,, M||MM||:|| .y 

HRSRMLSQELSRLYAQTAAELLCSQNIAPSDITALGCHGQTVRHAPEHS 

70 80 90 100 110 120 



80 



130 



140 



150 160 170 180 

AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGIANISVLPPDA 
1 I ' I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | : | M | || || || I I I I I I I I. I I I 

AERTQIFTVGDFRSRDIAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPpiA 
130 140 150 160 170 180 



190 200 210 220 230 240 

PAFGFDTGPGNMmDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I || I | | | | | j I M I I I I I I II I I I I I I I 

PAFGFDTGPGNMMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
190 200 210 220 230 240 



250 



260 



270 



280 290 "inn 

llllV 1 [ 1 1 1 1 1 1 1 1 M I M f I I I I I I I M M I I M I M I I ! I I M M I I I I I I l l l i 
GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYI^ 
250 260 270 280 



290 



300 
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310 320 330 340 350 360 

ml21-l.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I N I J I i I I I I I I I I I I I : I I | | | | | | | | | I I I : I I I I : | | | | j | | | | | | | | | | | | M 
al21 LMADIAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml21-l.pep XAGYYYX 
I I II I I 

a 121 GAGYYYX 



128 and 128-1 



ml2 8 . seq 


(partial ) 


1 


ATGACTGACA 


51 


AATCAAAACC 


101 


CG CGCGAAC A 


151 


AACACTGTCG 


201 


GGGCGTGGTG 


251 


CCGTCTATAA 


301 


GGACAAGACA 


351 


CGAATTCGAC 


1 


TACGCCAGCG 


51 


wGTCAAAAAA 


101 


AAmTCAAAAA 


151 


TGGCACAAAG 


201 


AGGCGG CGTT 


251 


CGTGGATGAA 


301 


CAAyTGCCCA 


351 


CAGGGAAGCC 


401 


CCGGACACGG 


451 


TCCGGCATCA 


501 


TATGGAAAAT 


551 


ACGAAGAAAC 


601 


GCCGCCAAAA 


651 


CGCCCTCTTT 


701 


AAAACTGGCA 


751 


CAGCCGCCCG 


801 


AGGCGGCTAT 


851 


GCGCGGACGC 


901 


GGCAAACGCT 


951 


nGCAGAATCC 


1001 


TCTTGCGCCA 



This corresponds to the amino acid sequence <SEQ ID 3086; ORF 128>: 

ml28 .pep (partial) 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA I KAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNCVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNS PEFD TLSPAQKTKL NH 



// 



1 YASEKLREAK YAFSETXVKK YFPVGXVLNG LFAQXKKLYG IGFTEKTVPV 

51 WHKDVRYXEL QQNGEXIGGV YMDLYAREGK RGGAWMNDYK GRRRFSDGTL 

101 QLPTAYLVCN FAPPVGGREA RLSHDEILIL FHETGHGLHH LLTQVDELGV 

151 SGINGVXWDA VELPSQFMEN FVWEYNVLAQ XSAHEETGVP LPKELXDKXL 

201 AAKNFQXGMF XVRQXE FALF DMMIYSEDDE GRLKNWQQVL DSVRKKVAVI 

251 QPPEYNRFAL SFGHIFAGGY SAAXYSYAWA EVLSADAYAA FEESDDVAAT 

301 GKRFWQEILA VGXSRSGAES FKAFRGREPS IDALLRHSGF DNAV* 
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The following partial DNA sequence was identified in K gonorrhoeae <SEO ID 3087>* 

gl28 . seq ^ • 

1 atgattgaca acgCActgct ccacttgggc gaagaaccCC GTTTTaatca 
51 aatccaaacc gaagACAtca AACCCGCCGT CCAAACCGCC ATCGCCGAAG 
101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 
151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACG CGTCG GCAGGATTTG 
201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 
251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 
301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 
3 51 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 
401 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 
451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
501 CCAAAACGTC CTAGACGCGA .CCGACGCGTT CGGCATTTAC TTTGACGATG 
551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 
651 GCAC TACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 
701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 
751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 
801 AACCGccaaa cTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 
851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 
1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 
1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 
1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 
1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 
1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
1251 CGCGTGGATG AACGACtaca AAGGCCGCCG CCGCTTTGCC GACGgcacGC 
1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

13 51 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AacCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
14 51 TGTCCGGCAT CAAcggcgtA GAATGGGACG CGGTCGAACT GCCCAGCCAG 
1501 TTTATGGAAA ACTTCGTTTG GGAATACAAT GTATTGGCAC AAATGTCCGC 
1551 CCACGAAGAA AccgGCGAGC CCCTGCCGAA AGAACTCTTC GACAAAATGC 
1601 TcgcCGCCAA AAACTTCCAG CGCGGTATGT TCCTCGTCCG GCAAATGGAG 
1651 TTCGCCCTCT TCGATATGAT GATTTACAGT GAAAGCGACG AATGCCGTCT 
1701 GAAAAACTGG CAGCAGGTTT TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 
1751 TCCAACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCacatctTC 
1801 GCcggcGGCT ATTCCGCAGG CTATTACAGC TACGCATGGG CCGAAGTCCt 
1851 CAGCACCGAT GCCTACGCCG CCTTTGAAGA AAGcGACGac QtcGCCGCCA 
1901 CAGGCAAACG CTTCTGGCAA GAAAtccttg ccgtcggcgg ctCCCGCAGC 
1951 gcgGCGGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 
2 001 ACTGCTGCGC CAaagcggtT TCGACAACGC gGCttgA 

This corresponds to the amino acid sequence <SEQ ID 3088* ORF 128 ne>* 

gl28.pep ' B * 

1 MIDNALLHLG EEPRFNQIQT ED I KPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDD AAPLAG I PEDA1AMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG ' 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

451 GKEARLSHDE I LTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGEPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS ESDECRLKNW QQVLDSVRKE VAVIQPPEYN RFANSFGHIF 

601 AGGYSAGYYS YAWAEVLSTD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
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651 AAESFKAFRG REPSIDALLR QSGFDNAA* 



ORF 128 shows 91.7% identity over a 475 aa overlap with a predicted ORF (ORF 128.ng) 
from N. gonorrhoeae: 

ml28/gl28 

10 20 30 40 50 60 

gl28 .pep MIDNALLHLGEEPRFNQIQTEDIKPAVQTAIAEARGQIAAVKAQTHTGWANTVERLTGIT 

I IMIMIIIIMhlhMMIIhllllllll MlhlMIIIIMIIII MM 

MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
10 20 30 40 50 60 



ml28 



7 ° 60 90 100 110 120 

gl28.pep ERVGRIWGWSHLNSWDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

NIMIIIIMIII hlllllMlillllMIIIMMIMIIIIIIIlllllllMi 

ml 2 8 ERVGRI WGWSHLNCVADTPELRAVYNELMPE I TVFFTE IGQD I ELYNRFKTI KNS PEFD 

70 80 90 100 110 120 

130 140 150 160 170 180 

gl28 . pep TLS PAQ KTKLDHDLRDFVLS GAE L P PERQAEIiAKLQTEGAQLS AKFSONVLDATDAFG I Y 

lllllllllhl 

ml 2 8 TliSPAQKTKLNH 

130 

// 

340 350 360 

g!28 . pep YAGEKLREAKYAFSETEVKKYFPVGKVLAG 

IMIIIIIIIIIMI IIIIMII II I 

YASEKLREAKYAFSETXVKKYFPVGXVLNG 
10 20 30 



ml28 



370 380 390 400 410 420 

gl28 .pep LFAQ I KKLYG I GFAE KTVPVWHKDVRYFE LQQNGKT I GGVYMDL YAREGKRGGAWMND YK 

Nil I MM;MMMMM II 1 1 I I I I I 1 1 I I 1 1 1 1 I I I II 1 1 I I I I I I 

m 128 LFAQXKXLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWMNDYK 

40 50 60 70 80 90 

430 440 450 460 470 480 

gl 2 8 . pep GRRRFADGTLQLPTAYLVCNFAP PVGGKEARLSHDE I LTLFHETGHGLHHLIiTQVDELGV 

I M I i - 1 I 1 1 1 1 1 1 1 ! 1 1 1 ! I M I M I = M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 

GRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDE I LI LFHETGHGLHHLLTQVDELGV 
100 110 120 130 140 150 



ml28 



490 500 510 520 530 540 

9128 . pep SGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGEPLPKELFDKMLAAKNFQRGMF 

llllll M 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 M 1 1 IMIIII MINI II IIIMM III 

mi2 8 SGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQXGMF 

I 60 170 180 190 200 210 

550 560 570 580 590 600 

gl28 . pep LVRQMEFALFDMMIYSESDECRLKNWQQVLDSVRKEVAVIQPPEYNRFANSFGHIFAGGY 

Ml IIMIIIIIIIhll M M M MM M MM M M M M l M MMMMM 

XVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIFAGGY 
220 230 240 250 260 270 



ml28 



610 620 630 640 650 660 

g!28 . pep SAG YYS YAWAEVLS TDAYAAFEE S DDVAATGKRFWQE I LAVGGSRS AAES FKAFRGRE P S 

I- MM MMMM MMMMMMMMMMM MM MMMMMMMM 

ml 2 8 SAAXYS YAWAEVLSADAYAAFEESDDVAATGKRFWQE I LAVGXSRSGAES FKAFRGRE PS 

280 290 300 310 320 330 
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670 679 
gl2 8 .pep IDALLRQSGFDNAAX 

IMIihIIIMI: 

ml 2 8 IDALLRHSGFDNAVX 

340 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 3089>* 

al28. seq v: 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAG AC AT C A AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAAC CAT C A AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

451 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA C AAC CGC AAA CTGCGCGAAC 

7 01 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

17 51 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGT AT T 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ED 3090; ORF 128.a>- 

al28.pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQOA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQONGET 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28/al28 ORFs 128 and 128.a showed a 66.0% identity in 677 aa overlap 

10 20 30 40 50 60 

ml28 . pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r m 1 1 1 1 1 1 1 1 1 1 1 1 

MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
10 20 30 40 50 60 



al28 



70 80 90 100 110 120 

ml2 8 . pep ERVGRIWGVVSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
I I I I I M I I I I I I I I : I I I M | | : | | | | | | | | | | f | | | | | t | I [ |f | | | | | | | | | | || | 

ERVGRIWGWSHLNSVTDT.PELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 



al28 



130 

ml28.pep TLSPAQKTKLNH 

111 I I M II II 

a128 TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

130 140 150 160 170 180 



inl28.pep 
al28 

ml28 .pep 
al28 

ml28 . pep 
al28 

ml28.pep 
al28 

inl28.pep 
al28 

ml28 .pep 
al28 

ml28 .pep 
al28 



FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 
19 ° 200 210 220 230 240 

TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
250 260 270 280 290 300 

140 150 
YASEKLREAKYAFSETXVKKYFPVGX 
M : I I If I I | | | | | | | || | | | J I | 
ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 
310 320 330 340 350 360 

160 170 180 190 200 210 

VLNGLFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWM 
MINIM I I II I I I I I I I I I | | | | M | | | I I M M I : I I I I I I I I I I M I I j j M I I 
VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
370 380 390 400 410 420 

220 230 240 250 260 270 

NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
' I I I M I I I I I M I M I I I I M M | : 1 M | | : | | | | | | | | j | | | | | I I I I I I I I I I I M 

NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 
430 440 450 460 470 480 

280 290 300 310 320 330 

ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQ 
I M I I I I I I I I I I I I I I | I | | | | | | | | | | | | | I II I I M j I I I I I | | || | | | | | | j 
ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAPCNFQ 
490 500 510 520 530 540 

340 350 360 370 380 390 

XGMFXVRQXEFALFDMMIYSEDDEGRLPCNWQQVLDSVRKKVAVIQPPEYNRFALSFGHI F 
1 1 I < < ! I I I ! I I I I I I II I I I I I I I I | I M I I I I I I : I I I :: | I M I I I I I I I I M 
RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFJ^JSFGHIF 
550 560 570 580 590 600 
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400 410 420 430 440 k„ 

ml 2 8.pep ^^^y f T^y^y '■'^''•'^'''GKRFWQE I LAVGXSRS GAE S FBCAFRG 

^ ZWL ^AAAiAA^^ ^ ii^ii^^i^ i iA^ic^ A^AA^ ^ i-iA^ci 

610 620 630 640 650 660 

460 470 
ml28 .pep REPSIDALLRHSGFDNAVX 

M I ( I M I I I 1 I i I I II : 
al28 REPSI DALLRH S G FDNAAX 

670 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 3091>: 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 
AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

aatcgccgcc atcaaagccc aaacgcacac cggctgg^a 
aacactgtcg aacccctgac cggcatcacc gaacgcgtcg gcaggat?tg 

GGGCGTGGTG TCGCACCTCA ACTCCGTCGC CGACACGCCC GAACTGCGCr 

ccgtctataa cgaactgatg cccgaaatca ccgtcttcS c^SSlic 

GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA SSS^Jc 

cgaattcgac accctctccc ccgcacaaaa aaccaaactc aaccacgSc 

TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

gaactggcaa aactgcaaac cgaaggcgcg caactttccg ccaS™? 

CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

ccgcaccgct tgccggcatt cccgaagacg cgctcgccat gtttgccgc? 

GCCGCGCAAA GCGAAAGCAA AACAGGCTAC AAAATCGGCT TGCAGATTCC 
ACACTACCTC GCCGTCATCC AATACGCCGA CAACCGCGAA CTGCGCGAAC 
AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAACTTTC AGAcSc^C 
AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGCAA ACGCCCTGCA 
AACCGCCAAA CTGCTCGGCT T C AAAAAC T A CGCCGAATTG TCGCTGGCAA 
CCAAAATGGC GGACACGCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 
GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTcJSgC 
CTTCGCCCGC GAAAGCCTGA ACCTCGCCGA TTTGCAACCG TGGGACtJgG 

gctacgccag cgaaaaactg cgcgaagcca aatacgcgtt cagcgaaacc 
gaagtcaaaa aatacttccc cgtcggcaaa gtattaaacg gactgttcgc 

CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 
TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCgSScC 
A ™^ GGGCG tttatatgga TTTGTACGCA CGCGAAGGCA M S 
CGCGTGGATG AACGACTACA AAGGCCGCCG ccgtttttca gacggcacgc 
TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC ScCG^GGC 

GGCAGGGAAG cccgcctgag ccacgacgaa atcctcatcc tcttSSK 

^n^ ACAC GGGCTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
TATCCGGCAT CAACGGCGTA GAATGGGACG CGGTCGAACT GCCCAGCCAG 
TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCAC AAATGTCAGC 
CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAaStGC 
3^GCCAA ^TTCCAA CGCGGCATGT TCCTCGTCCG GCAAA^GAG 

ttcgccctct ttgatatgat GATTTACAGC gaagacgacg aaggccgtct 

GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAAAA GTCGCcSS 

TCCAGCCGCC cgaatacaac CGCTTCGCCT tgagcttcgg ccacatcttc 

GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 
^^ GGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGcS 
CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 
GCGGCAGAAT CCTTCAAAGC CTTCCGCGGC CGCGAACCGA GCMAgSgC 
ACTCTTGCGC CACAGCGGTT TCGACAACGC GGTCTGA &CATAGACGC 
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This corresponds to the amino acid sequence <SEQ ID 3092; ORF 128-1>- 

ml28-l .pep. 5 

^ m^1 L » HLG EEPRFE) Q IKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 
51 NTVEPLTGIT ERVGRIWGVV SHLNSVADTP ELRAVYNELM PEITVFFtS 
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101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NHDLRDFVLS GAELPPEQQA 
151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSESKTGY KIGLQIPHYL AVIQYADNRE LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLANALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLNLADLQP WDLGYASEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFAPPVG 

4 51 GREARLSHDE ILILFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKK VAVIQPPEYN RFALSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG RE PS I DAL LR HSGFDNAV* 

The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3093>: 

gl28-l.seq (partial) 

1 ATGATTGACA ACGCACTGCT CCACTTGGGC GAAGAACCCC GTTTTAATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAAC CAT C A AAAATTCCCC 

351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

4 01 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

7 01 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGCCAAA CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCT CG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 C C AAAT C AAA AAACT CTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG v> 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGCTTTGCC GACGGCACGC 

1301 . TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TGTCCGGCAT CAACGGCGTA AAA 

This corresponds to the amino acid sequence <SEQ ID 3094; ORF 128-l.ng>: 

gl28-l .pep (partial) 

1 MIDNALLHLG EEPRFNQIKT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV K 

ml28-l/gl28-l ORFs 128-1 and 128-1. ng showed a 94.5% identity in 491 aa 
overlap 
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ml28-l ELGVSGINGTOWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

490 500 510 520 530 540 

The following DNA sequence was identified in N. meningitidis <SEO ID 3095>- 

al28-l.seq ^ 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

1M * G ™^ CA ^ra^C ATCAAAGCCC AAACGCACAC CGSCTsSS 

11] ^^.l^ AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCG CC 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTcS 

IV; ^ C ,^ GACA TCGAGCTG ™ CAACCGCTTC AAAACCATCA 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

401 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA aSaSaGgS 

451 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG 



BNSDOCID: <WO 9957260A2J_> 



WO 99/57280 PCT/US99/09346 

101 



501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

1451 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

17 01 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 3096; ORF 128-1. a>: 

al28-l .pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAES FKAFRG REPSIDALLR HSGFDNAA* 



ml28-l/al28-l ORFs 128-1 and 128-1. a showed a 97.8% identity in 677 aa overlap 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3097>: 

m206.seq 



BNSDOCID: <WO 9957280A2J_> 



WO 99/57280 PCT/US99/09346 

103 



1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

2 01 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 
251 TGATTCAATT CGTTTACAAr AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

3 01 GCCCGCGACA TGGCGGCGGC AAGCCGsAAA ATCCCCGAcA GCCGCyTCAA 
351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

4 01 ACGTCGGACT CTACATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 
451 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 
501 CTACCTCGGC GCACATACTT TTTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 3098; ORF 206>: 

m206 .pep . . 

1 MFPPDKTLFL CLSALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRXKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3099>: 

g206 . seq 

1 atgttttccc ccgacaaaac ccttttcctc tgtctcggcg cactgctcct 

51 cgcctcatgc ggcacgacct ccggcaaaca ccgccaaccg aaacccaaac 

101 agacagtccg gcaaatccaa gccgtccgca tcagccacat cggccgcaca 

151 caaggctcgc aggaactcat gctccacagc ctcggactca tcggcacgcc 

201 ctacaaatgg ggcggcagca gcaccgcaac cggattcgac tgcagcggca 

251 tgattcaatt ggtttacaaa aacgccctca acgtcaagct gccgcgcacc 

3 01 gcccgcgaca tggcggcggc aagccgcaaa atccccgaca gccgcctcaa 

3 51 ggccggcgac atcgtattct tcaacaccgg cggcgcacac cgctactcac 

4 01 acgtcggact ctacatcggc aacggcgaat tcatccatgc ccccggcagc 
4 51 ggcaaaacca tcaaaaccga aaaactctcc acaccgtttt acgccaaaaa 
501 ctaccttgga gcgcatacgt tttttacaga atga 

This corresponds to the amino acid sequence <SEQ ID 3100; ORF 206.ng>: 

g206 .pep 

1 MFSPDKTLFL CLGALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIGRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQLVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRLKAGD IVFFNTGGAH RYSHVGLYIG NGEFIHAPGS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 



ORF 206 shows 96.0% identity over a 177 aa overlap with a predicted ORF (ORF 206.ng) 
from TV. gonorrhoeae: 

m206/g206 
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m206 . pep MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 
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9206 MFSPDKTLFLCLGALLLASCGTTSGKHRQPKPKQTVRQI QAVRI SHIGRTQGSQELMLHS 
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9206 LGLIGTPYKWGGSSTATGFDCSGMIQLVYKNALNVKLPRTARDMAAASRKIPDSRLKAGD 
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m206.pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 
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9206 IVFFNTGGAHRYSHVGLYIGNGEFIHAPGSGKTIKTEKLSTPFYAKNYLGAHTFFTE 
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The following partial DNA sequence was identified in N. meningitidis <SEO ID 3101: 

a206 . seq 



a206 . seq 






1 


ATGTTTCCCC 


CCGACAAAAC 


51 


CGCCTCATGC 


GGCACGACCT 


101 


AGACAGTCCG 


GCAAATCCAA 


151 


CAAGGCTCGC 


AGGAACTCAT 


201 


CTACAAATGG 


GGCGGCAGCA 


251 


TGATTCAATT 


CGTTTACAAA 


301 


GCCCGCGACA 


TGGCGGCGGC 


351 


GGCCGGCGAC 


CTCGTATTCT 


401 


ACGTCGGACT 


CTATATCGGC 


451 


GGCAAAACCA 


TCAAAACCGA 


501 


CTACCTCGGC 


GCACATACTT 



TCTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 3102" ORF 206 a>- 

a206.pep ' 

* MFPPDKTLFL CLSALLLASC CTTB^pn, KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRLKAGD LVFFNTGGAH RYSHVGLYIG NGE FIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* ^tiHAPSS 

m206/a206 ORFs 206 and 206.a showed a 99.4% identity in 1 77 aa overlap 
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a206 MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQ^ 
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m206.pep 
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The following partial DNA sequence was identified in JV. meningitidis <SEO ID 3103>- 

m.287 . seq 

1 ATGTTTAAAC GCAGCGTAAT CGCAATGGCT TGTATTTTTG CCCTTTCAPf 

ini ^^^ GGGC GGCGGTGGCG GATCGCCCGA TGTCAAGTCG GCGGACACGC 

^ rtlr^^ TGCCGCCCCT GTTGTTTCTG AAAAAGAGAC AGAGG CAAAG 

151 GAAGATGCGC CACAGGCAGG TTCTCAAGGA CAGGGCGCGC CATCCGCACA 
AGGGAGTCAA GATATGGCGG CGGTTTCGGA AGAAAATACA GGCAATGGCG 

251 GTGCGGTAAC AGCGGATAAT CCCAAAAATG AAGACGAGGT GGCACAAAAT 

301 GATATGCCGC AAAATGCCGC CGGTACAGAT AGTTCGACAC CGAATCACAC 

351 CCCGGATCCG AATATGCTTG CCGGAAATAT GGAAAATCAA GCAACGgSg 

4 01 ccggggaatc gtctcagccg gcaaaccaac cggatatgg? SSJSSSS 

5m ™rr^ GC AGGGGGACGA TCCGTCGGCA GGCGGGCAAA SjJcSS 
o AGGGGTGCC CAftGGTGCAA ATCAAGCCGG AAACAATCAA GCCGCCGGTT 
551 CTTCAGATCC CATCCCCGCG TCAAACCCTG CACCTGCGAA TGGCGGTAGC 
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601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 



AATTTTGGAA 
GCAAAATATA 
ATTTCTTGGA 
GATG CAGAC A 
TGTCGGTTTG 
TTATCTTTTA 
GCACGGTCGA 
TCAGGCGGAT 
ATTCCGGCAA 
GGGGCGGAAA 
ACCGGCAAAA 
TACTGCATTT 
TTTGCCGCAA 
CAGCGGCGAT 
ATGGAAACGG 
TCCGGAAAGT 
CTATCGCCCG 
AAAAAGAGCA 



GGGTTGATTT 
ACGTTGACCC 
TGAAGAAGTA 
AAATAAGTAA 
GTTGCCGATA 
TAAACCTAAA 
GGCGGTCGCT 
ACGCTGATTG 
TATCTTCGCG 
AATTGCCCGG 
GGCGAAATGC 
CCATACGGAA 
AAGTCGATTT 
GATTTGCATA 
CTTTAAGGGG 
TTTACGGCCC 
ACAGATGCGG 
GGATTGA 



GGCTAATGGC 
ACT GTAAAGG 
CAGCTAAAAT 
TTACAAGAAA 
GTGTGCAGAT 
CCCACTTCAT 
TCCGGCCGAG 
TCGATGGGGA 
CCCGAAGGGA 
CGGATCGTAT 
TTGCGGGCGC 
AACGGCCGTC 
CGGCAGCAAA 
TGGGTACGCA 
ACTTGGACGG 
GGCCGGCGAG 
AAAAGGGCGG 



GTTTTGATTG 
CGATTCTTGT 
CAGAATTTGA 
GATGGGAAGA 
GAAGGGAATC 
TTGCGCGATT 
ATGCCGCTGA 
AGCGGTCAGC 
ATTACCGGTA 
GCCCTTCGTG 
GGCCGTGTAC 
CGTACCCGAC 
TCTGTGGACG 
AAAATTCAAA 
AAAATGGCAG 
GAAGTGGCGG 
ATTCGGCGTG 



ACGGGCCGTC 
AGTGGCAATA 
AAAATTAAGT 
ATGATAAATT 
AATCAATATA 
TAGGCGTTCT 
TTCCCGTCAA 
CTGACGGGGC 
TCTGACTTAC 
TTCAAGGCGA 
AACGGCGAAG 
CAGGGGCAGG 
GCATTATCGA 
GCCGCCATCG 
CGGGGATGTT 
GAAAATACAG 
TTTGCCGGCA 



This corresponds to the amino acid sequence <SEQ ID 3104; ORF 287>: 

m2 87 . pep 

1 MFKRSVIAMA CIFALSA CGG GGGGSPDVKS ADTLSKPAAP WSEKETEAK 

51 EDAPQAGSQG QGAPSAQGSQ DMAAVSEENT GNGGAVTADN PKNEDEVAQN 

101 DMPQNAAGTD SSTPNHTPDP NMLAGNMENQ ATDAGESSQP ANQPDMANAA 

151 DGMQGDDPSA GGQNAGNTAA QGANQAGNNQ AAGSSDPIPA SNPAPANGGS 

201 NFGRVDLANG VLIDGPSQNI TLTHCKGDSC SGNNFLDEEV QLKSEFEKLS 

251 DADKISNYKK DGKNDKFVGL VADSVQMKGI NQYIIFYKPK PTSFARFRRS 

301 ARSRRSLPAE MPLIPVNQAD TLIVDGEAVS LTGHSGNIFA PEGNYRYLTY 

351 GAEKLPGGSY ALRVQGEPAK GEMLAGAAVY NGEVLHFHTE NGRPYPTRGR 

401 FAAKVDFGSK SVDGIIDSGD DLHMGTQKFK AAI DGNGFKG TWTENGSGDV 

4 51 SGKFYGPAGE EVAGKYSYRP TDAEKGGFGV FAGKKEQD* 



The following partial DNA sequence was 

g287 . seq 

1 atgtttaaac gcagtgtgat 

51 ctgtgggggc ggcggtggcg 

101 cgtcaaaacc ggccgccccc 

151 ctgccgaaag aaaagaaaga 

201 cgatacgcag gacgcaaccg 

251 tttcggcaga aaatacaggc 

301 aaaaatgaag acgcgggggc 

351 atccgcaaat caaacaggga 

401 cccccgcgtc aaaccctgcc 

451 acgaacgtgg gcaattctgt 

501 gttgacccac tgtaaaggcg 

551 aagaagcacc gtcaaaatca 

601 attaagcgat ataaaaaaga 

651 tgctgacagg gtaaaaaagg 

701 cggacaaacc acctactcgt 

751 gagattccgc tgattcccgt 

801 ggaagcggtc agcctgacgg 

851 ggaattaccg gtatctgact 

901 tatgccctcc gtgtgcaagg 

951 cacggccgtg tacaacggcg 

1001 gtccgtaccc gtccggaggc 

1051 aaatctgtgg acggcattat 

1101 gcaaaaattc aaagccgcca 

1151 cggaaaatgg cggcggggat 

1201 gaggaagtgg cgggaaaata 

1251 cggattcggc gtgtttgccg 



identified in N. gonorrhoeae <SEQ ID 3105>: 



tgcaatggct 
gatcgcccga 
gttgttgctg 
tgaggaggca 
ccggagaagg 
aatggcggtg 
gcaaaatgat 
acaaccaacc 
cctgcgaatg 
tgtgattgac 
attcttgtaa 
gaatttgaaa 
cgagcaacgg 
atggaactaa 
tctgcacggt 
caatcaggcc 
ggcattccgg 
tacggggcgg 
cgaaccggca 
aagtgctgca 
aggtttgccg 
cgacagcggc 
tcgatggaaa 
gtttccggaa 
cagctatcgc 
gcaaaaaaga 



tgtatttttc 
tgtcaagtcg 
aaaatgccgg 
gcgggcggtg 
cagccaagat 
cggcaacaac 
atgccgcaaa 
cgccggttct 
gcggtagcga 
ggaccgtcgc 
tggtgataat 
aattaagtga 
gagaattttg 
caaatatatc 
cgaggaggtc 
gatacgctga 
caatatcttc 
aaaaattgcc 
aaaggcgaaa 
tttccatatg 
caaaagtcga 
gatgatttgc 
cggctttaag 
ggttttacgg 
ccgacagatg 
tcgggattga 



ccctttcagc 
gcggacacgc 
ggaaggggtg 
cgccgcaagc 
atggcggcag 
ggacaacccc 
atgccgccga 
tcagattccg 
ttttggaagg 
aaaatataac 
ttattggatg 
tgaagaaaaa 
tcggtttggt 
atcttctata 
gcttccggcc 
ttgtggatgg 
gcgcccgaag 
cggcggatcg 
tgcttgttgg 
gaaaacggcc 
tttcggcagc 
atatgggtac 
gggacttgga 
cccggccggc 
ctgaaaaggg 



This corresponds to the amino acid sequence <SEQ ID 3106; ORF 287.ng>: 

g287 .pep 

1 MFKRSVIAMA CIFPLSACGG GGGGSPDVKS ADTPSKPAAP WAENAGEGV 



BNSDOCID: <WO ©957280A2_I_> 



WO 99/57280 



106 



PCT/US99/09346 



51 



LPKEKKDEEA AGGAPQADTQ DATAGEGSQD MAAVSAENTG NGGAATTDNP 

101 KNEDAGAQND MPQNAAESAN QTGNNQPAGS SDSAPASNPA PANGGSDFGR 

151 TNVGNSWID GPSQNITLTH CKGDSCNGDN LLDEEAPSKS EFEKLSDEEK 

201 IKRYKKDEQR ENFVGLVADR VKKDGTNKYI IFYTDKPPTR SARSRRSLPA 

251 EIPLIPVNQA DTLIVDGEAV SLTGHSGNIF APEGNYRYLT YGAEKLPGGS 

301 YALRVQGE PA KGEMLVGTAV YNGEVLHFHM ENGRPYPSGG RFAAKVDFGS 

351 KSVDGIIDSG DDLHMGTQKF KAAIDGNGFK GTWTENGGGD VSGRFYGPAG 

401 EEVAGKYSYR PTDAEKGGFG VFAGKKDRD* 



m287/g287 ORFs 287 and 287.ng showed a 70.1% identity in 499 aa overlap 

10 20 30 40 to, 

m287 . pep MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSE KFTva 

Nil UN MM I 1 Ml HI M Mil M ?? I, 

9287 MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPWAENAGEGVLPKEKKDEEA 
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m287 .pep 
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g287 YGAEKLPGGS YALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRFAAIWDFGS 
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9287 KSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENG^ 

360 370 380 390 400 410 

470 480 489 

m2 8 7 . pep PTDAEKGGFGVFAGKKEQDX 
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a287 . seq 






1 


ATGTTTAAAC 


GCAGTGTGAT 


51 


CTGTGGGGGC 


GGCGGTGGCG 


101 


TGTCAAAACC 


TGCCGCCCCT 


151 


CTGCCGAAAG 


AAAAGAAAGA 


201 


CGATACGCAG 


GACGCAACCG 


251 


TTTCGGCAGA 


AAATACAGGC 


301 


GAAAATAAAG 


ACGAGGGACC 


351 


TACAGATAGT 


TCGACACCGA 


401 


GAGATATGGG 


AAACCAAGCA 


451 


AACCAACCGG 


ATATGGCAAA 


501 


GTCGGCAGGG 


GAAAATGCCG 


551 


CTGAAAACAA 


TCAAGTCGGC 


601 


CCTAACGCCA 


CGAATGGCGG 


651 


TGGCATCAAG 


CTTGACAGCG 


701 


AAGACAAAGT 


ATGCGATAGA 


751 


TCAGAATTTG 


AAAAATTAAG 


801 


AGACGAGCAA 


CGAGAGAATT 


851 


AGAATGGAAC 


TAACAAATAT 


901 


TCTTCATCTG 


CGCGATTCAG 


951 


GGCCGAGATG 


CCGCTGATTC 


1001 


ATGGGGAAGC 


GGTCAGCCTG 


1051 


GAAGGGAATT 


ACCGGTATCT 


1101 


ATCGTATGCC 


CTCAGTGTGC 


1151 


CGGGCACGGC 


CGTGTACAAC 


1201 


GGCCGTCCGT 


CCCCGTCCGG 


1251 


CAGCAAATCT 


GTGGACGGCA 


1301 


GTACGCAAAA 


ATTCAAAGCC 


1351 


TGGACGGAAA 


ATGGCGGCGG 


1401 


CGGCGAAGAA 


GTGGCGGGAA 


1451 


AGGGCGGATT 


CGGCGTGTTT 



This corresponds to the amino acid sequence <SEQ ID 3108; ORF 287.a>: 

a287.pep 

MFKRSVIAMA CIVALSA CGG GGGGSPDVKS ADTLSKPAAP WTEDVGEEV 
LPKEKKDEEA VSGAPQADTQ DATAGKGGQD MAAVSAENTG NGGAATTDNP 
ENKDEGPQND MPQNAADTDS STPNHTPAPN MPTRDMGNQA PDAGESAQPA 
NQPDMANAAD GMQGDDPSAG ENAGNTADQA ANQAENNQVG GSQNFASSTN 
PNATMGGSDF GRINVANGIK LDSGSENVTL THCKDKVCDR DFLDEEAPPK 
SEFEKLSDEE KINKYKKDEQ RENFVGLVAD RVEKNGTNKY VIIYKDKSAS 
SSSARFRRSA RSRRSLPAEM PLIPVNQADT LIVDGEAVSL TGHSGNIFAP 
EGNYRYLTYG AEKLSGGSYA LSVQGEPAKG EMLAGTAVYN GEVLHFHMEN 
GRPSPSGGRF AAKVDFGSKS VDGIIDSGDD LHMGTQKFKA VIDGNGFKGT 
WTENGGGDVS GRFYGPAGEE VAGKYSYRPT DAEKGGFGVF AGKKEQD* 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 



m287/a287 ORFs 287 and 287. a showed a 77.2% identity in 501 aa overlap 
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a287 



m287.pep 



a287 
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MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSE KETEA 
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MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPAAPWTEDVGEEVLPKEKKDEEA 
!° 20 30 40 50 60 
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70 80 90 100 110 
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m287 . pep ^ ^ ^^^"y ^^^^^^^^'v'^-^-^^^^^Q-^-TDAGE S S Q^ANQE»DMA.NAADGMQGDDPSAGGQNAC3NTA 

™™^^ 
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m287. P e P CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKP 

CD-RDFLDEEAPPKSEFEKLSDEEKINKYKKDEORENFVGLVADRVEKNGTNKwkiiD 
^ u Zb0 260 270 280 290 



a287 



290 300 310 



320 330 340 



m287 . pep KP ySFARFRRSARSRRSLPAEMPLIPWQADTLIVDGEAVSLTGHSGNI 
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300^^^^^^f^^^^ L ^^^^ IP ^^^^ IV ^^^^^^ 
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??^fT??f : ! : ????? LHMGTQKFKAAIDGNGFKG,rWTENGS GDVSGKFYGPAGEEVAGKYS 



a287 

m287.pep 
a287 

td2 8 7 . pep 

'''''' i i i i i i i i i i i i i i i i i i : i i i i i i i i i I i TTTTTTiTi -11111111111111 

a287 GSKSVDGIIDSGDDLHMGTQKFKAVIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAG 



420 430 440 

470 480 489 

m2 87 . pep YRPTDAEKGGFGVFAGKKEQDX 
M M 1 I M I I I I I J || I | M | j 
YRPTDAEKGGFGVFAGKKEQDX 
480 490 



450 460 470 



a287 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3109>- 

1 ATGCAAGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGTAAACGCT 

101 TTGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 CACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

251 TTGATGCACT GATTCGTGGC GAATACATAA ACAGCCCTGC CGTCCGTACC 

301 GATTACACCT ATCCACGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

351 TTTGACAGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

401 CTCGCACCCA ATCAGACGGT AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 

451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CTAACCCGCG 

501 CGACACTGCC TTTCTTTCCC ACTTGGTACA GACCGTAT?? TT^GCGCG 
G ^ AGACGT TGTTTCTCCT GCCAATGCCG ATACAgS ££5S£ 

601 atcgacgtat tcggaacgat acgcaacaga accgaaatgc acctatacaa 

651 tgccgaaaca ctgaaagccc aaacaaaact ggaatatttc gcagSgSa" 



BNSDOCID: <WO 9957280A2J_> 
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701 GAACCAATAA AAAATTGCTC ATCAAACCAA AAACCAATGC GTTTGAAGCT 

751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTAAGCAA 

801 AGGAATTAAA CCGACGGAAG GATTAATGGT CGATTTCTCC GATATCCGAC 

851 CATACGGCAA TCATACGGGT AACTCCGCCC CATCCGTAGA GGCTGATAAC 

901 AGTCATGAGG GGTATGGATA CAGCGATGAA GTAGTGCGAC AACATAGACA 

951 AGGACAACCT TGA 



This corresponds to the amino acid sequence <SEQ ID 31 10; ORF 406>: 

m406 .pep 

1 MQARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSKSSLGLN 

151 I GGMGD YRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 

251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIRPYGNHTG NSAPSVEADN 

3 01 SHEGYGYSDE WRQHRQGQP * 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 31 1 1>: 

g406 . seq 

1 ATGCGGGCAC GGCTG CTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGCAAACGCT 

101 TCGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 AACTATGGGC GAC CAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

2 51 TTGATGCACT GATTCGCGGC GAATACATAA ACAGCCCTGC CGTCCGCACC 

3 01 GATTACACCT ATCCGCGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 
351 TTTGACGGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 
401 CGCGCACCCA ATCAGACGGT AGCGGAAGTA GGAGCAGTCT GGGCTTAAAT 
451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CCAACCCGCG 
501 CGACACTGCC TTTCTTTCCC ACTTGGTGCA GACCGTATTT TTCCTGCGCG 
551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACAGATGT GTTTATTAAC 
601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 
651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 
701 GAACCAATAA AAAATTGCTC ATCAAACCCA AAACCAATGC GTTTGAAGCT 
751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTAAGCAA 
801 AGGAATCAAA CCGACGGAAG GATTGATGGT CGATTTCTCC GATATCCAAC 
851 CATACGGCAA TCATACGGGT AACTCCGCCC CATCCGTAGA GGCTGATAAC 
901 AGTCATGAGG GGTATGGATA CAGCGATGAA GCAGTGCGAC AACATAGACA 
951 AGGGCAACCT TGA 



This corresponds to the amino acid sequence <SEQ ID 31 12; ORF 406>: 

g406 .pep 

1 MRARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSRSSLGLN 

151 I GGMGD YRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 

251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIQPYGNHTG NSAPSVEADN 

301 SHEGYGYSDE AVRQHRQGQP * 



ORF 406 shows 98.8% identity over a 320 aa overlap with a predicted ORF (ORF406.a) from 
TV. gonorrhoeae: 

g406/m406 

10 20 30 40 50 60 

g4 06 . pep MRARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 

h M I I I M I I 1 1 ! II 1 1 I M II I I I I i 1 1 M I I! ! sM I I I I I Ml I 1 1 M I I ; I II I I 

m4 06 MQARLL I PI LFSVFI LSACGTLTG I PSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 
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g4 06 .pep 

m40S 



110 

10 20 30 

70 80 



40 50 60 



70 80 9 0 100 no 120 

IMMIIIIMMIIIMMIII 1 1 1 j 1 1 M 1 1 1 1 1 1 f [ j 1 1 1 j M 1 1 1 1 1 * 1 1 ,,, 1 7 

KVALYIATMGDQGSGSLTGGRYSIDALIRGEYI^PA^ 

70 80 90 100 no 120 



130 140 150 160 170 inn 

lllllllllllll| IIIIIMIhllllll||MIMIIIIIIIMIIMIIIIIlllli 

LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLT^ 

130 140 .150 160 170 180 



m4 06 

g4 06 .pep 
m406 



190 200 210 220 230 240 

iJiJ-UU 1 1 11 1 1 1 1 1 1 1 11 1 1 1 1 1 1 11 1 ' 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M M 1 1 1 1 1 1 M 
flk^dwspanadtdwinidvfgtirnrtemhl^ 

190 200 210 220 230 240 



g40 6 .pep ™™™™*^ 

1 1111 ' 111111 111 I I I I I I II | MIM MM | | :|!|| I II Ml Ml Ml | 

m406 I KPKTNAFEAAYKENYALWMGP YKVS KG I KPTEGLMVDFSD I RP YGNHTGNS APSVE ADN 

250 260 270 280 290 



310 320 
g4 0 6 . pep SHEGYGYSDEAVRQHRQGQPX 

IMIMIIIhllllllllll 

SHEGYGYSDEWRQHRQGQPX 
310 320 



TO406 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 3 1 13>- 

a406.seq ^ 

1 ATGCAAGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 
" ACACTGACAG GTATTCCATC GCATGGCGGA GGTAAACGCT 

^^GTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 AACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

\l\ ll^l^ GATTCGTGGG GAATACATAA ACAGCCCTGC CGTCCGTACc" 

301 GATTACACCT ATCCACGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

H\ r™n°Z GGT TTAACCA ^ CT TT AT CT AC ACTTAA^GcS C^aSS 

4 01 CGCGCACCCA ATCAGACGGT AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 
*™ G '£ GGA TGGGG GATTA TCGAAATGAA ACCTTGACGA CTAACCCGCG 

11, ^ A ™^ C TTTCTTTCCC ACTTGGTACA GACCGTATTT TTCCTGCGCG 

551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACGGATGT GTTTATTAAC 

601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 

651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 

701 GAACCAATAA AAAATTGCTC ATCAAACCAA AAACCAATGC GTTTGAAGCT* 

751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGACCGTATA AAGTAAGCAA 

801 AGGAATTAAA CCGACAGAAG GATTAATGGT CGATTTCTCC GATATCCAAP 

851 CATACGGCAA TCATATGGGT AACTCTGCCC CATCCgSgA XSSSJ£ 

III SSSSSS ^ ATGGATA CAGCGATGAA GCAGTGCGAC GaSSg^A 

This corresponds to the amino acid sequence <SEQ ID 3 1 14- ORF 406 a>- 

a4 06.pep ' 

1 MQARLLIPIL FSVFILSACG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSKSsfrfw 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF fJrgSwSP StD^iS 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL 
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251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIQPYGNHMG NSAPSVEADN 
301 SHEGYGYSDE AVRRHRQGQP * 

m406/a406 ORFs 406 and 406. a showed a 98.8% identity in 320 aa overlap 

10 20 30 40 50 60 

m4 06.pep MQARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 
I I i M I I I I [ I I II I | I I M I I I I I I I I I I I I I I I M | ! | | | | | | ( |l I I I I I I I II I I I 
a406 MQARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m4 06. pep KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

I M I I f I M I I II I I M I I I I I I I I I M I i I M II II || | I M | M MINIMI 

a4 06 KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m4 0 6.pep LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 
I M M I I I I M I I I I I I I I 11 I 1 I I I I I I I I I I M M I I I I M I I i M I I I I I I I I I I M 
a4 06 LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

130 140 150 160 170 180 

190 200 210 220 230 240 

m4 0 6 . pep FLRGIDWS PANADTDVFINI DVFGT IRNRTEMHLYNAETLKAQTKLE YFAVDRTNKKLL 

M II I II I I 11 I I I I I M I I I I II I I I I I I II M I I I I II I I I II I I I M I I I I I I I M I 
a406 FLRGIDWSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m4 06 . pep IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIRPYGNHTGNSAPSVEADN 
I I I M I I I I I II I f I I I M I I I I I I I I I I I I I I I I ! I | | | M : I I I If I I I I I I I I I I I 
a4 06 IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIQPYGNHMGNSAPSVEADN 

250 260 270 280 290 300 

310 320 
m4 06 . pep SHEGYGYSDE WRQHRQGQPX 

I I II M I I I I : I I : II I I I I I 
a 4 0 6 SHEGYGYS DEAVRRHRQGQPX 

310 320 



EXAMPLE 2 
Expression of ORF 919 
The primer described in Table 1 for ORF 919 was used to locate and clone ORF 919. 
The predicted gene 919 was cloned in pET vector and expressed in E. coli. The product of 
protein expression and purification was analyzed by SDS-PAGE. In panel A) is shown the 
analysis of 919-His fusion protein purification. Mice were immunized with the purified 919- 
His and sera were used for Western blot (panel B), FACS analysis (panel C), bactericidal 
assay (panel D), and ELISA assay (panel E). Symbols: Ml, molecular weight marker; PP, 
purified protein, TP, N. meningitidis total protein extract; OMV, N. meningitidis outer 
membrane vesicle preparation. Arrows indicate the position of the main recombinant protein 
product (A) and the K meningitidis immunoreactive band (B). These experiments confirm 
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that 919 is a surface-exposed protein and that it is a useful immunogen. The hydrophilicity 
plots, antigenic index, and amphipatic regions ofORF 919 are provided in Figure 10. The 
AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 
143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, 
Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 919 and the amino acid 
sequence encoded thereby is provided in Example 1 . 

EXAMPLE 3 
Expression of ORF 279 
The primer described in Table 1 for ORF 279 was used to locate and clone ORF 279. 
The predicted gene 279 was cloned in pGex vector and expressed in E. coli. The product of 
protein expression and purification was analyzed by SDS-PAGE. In panel A) is shown the 
analysis of 279-GST purification. Mice were immunized with the purified 279-GST and sera 
were used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay 
(panel D), and ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. 
meningitidis total protein extract; OMV, N. meningitidis outer membrane vescicle preparation. 
Arrows indicate the position of the main recombinant protein product (A) and the N. 
meningitidis immunoreactive band (B). These experiments confirm that 279 is a surface- 
exposed protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, 
and amphipatic regions of ORF 279 are provided in Figure 1 1 . The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). 
The nucleic acid sequence of ORF 279 and the amino acid sequence encoded thereby is 
provided in Example 1. 



EXAMPLE 4 
Expression of ORF 576 and 576-1 
The primer described in Table 1 for ORF 576 was used to locate and clone ORF 576. 
The predicted gene 576 was cloned in pGex vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 576- 
GST fusion protein purification. Mice were immunized with the purified 576-GST and sera 
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were used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D), 
and ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. meningitidis 
total protein extract; OMV, N. meningitidis outer membrane vescicle preparation. Arrows 
indicate the position of the main recombinant protein product (A) and the N. meningitidis 
immunoreactive band (B).. These experiments confirm that ORF 576 is a surface-exposed 
protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, and 
amphipatic regions of ORF 576 are provided in Figure 12. The AMPHI program is used to 
predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). 
The nucleic acid sequence of ORF 576 and the amino acid sequence encoded thereby is 
provided in Example 1 . 

EXAMPLE 5 
Expression of ORF 5 1 9 and 519-1 
The primer described in Table 1 for ORF 519 was used to locate and clone ORF 519. 
The predicted gene 519 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 519- 
His fusion protein purification. Mice were immunized with the purified 519-His and sera were 
used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D), and 4 
ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. meningitidis total 
protein extract; OMV, N. meningitidis outer membrane vesicle preparation. Arrows indicate 
the position of the main recombinant protein product (A) and the N. meningitidis 
immunoreactive band (B). These experiments confirm that 519 is a surface-exposed protein 
and that it is a useful immunogen. The hydrophilicity plots, antigenic index, and amphipatic 
regions of ORF 519 are provided in Figure 13. The AMPHI program is used to predict 
putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, AIDS Res 
Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1 :9). The nucleic 
acid sequence of ORF 519 and the amino acid sequence encoded thereby is provided in 
Example 1. 
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EXAMPLE 6 
Expression of ORF 121 and 121-1 
The primer described in Table 1 for ORF 121 was used to locate and clone ORF 121. 
The predicted gene 121 was cloned in pET vector and expressed in E colu The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 121- 
His fusion protein purification. Mice were immunized with the purified 121 -His and sera were 
used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Results show that 12 1 is a surface-exposed protein. Symbols: 
Ml, molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. 
meningitidis outer membrane vescicle preparation. Arrows indicate the position of the main 
recombinant protein product (A) and the N. meningitidis immunoreactive band (B). These 
experiments confirm that 121 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 121 are provided in 
Figure 14. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1 989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1992, Scand J Immunol Suppl 11:9). The nucleic acid sequence of ORF 121 and the amino 
acid sequence encoded thereby is provided in Example 1. 



EXAMPLE 7 
Expression of ORF 128 and 128-1 
The primer described in Table 1 for ORF 128 was used to locate and clone ORF 128. 
The predicted gene 128 was cloned in pET vector and expressed in E coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 128- 
His purification. Mice were immunized with the purified 128-His and sera were used for 
Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel D) and 
ELISA assay (panel E). Results show that 128 is a surface-exposed protein. Symbols: Ml, 
molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. meningitidis 
outer membrane vesicle preparation. Arrows indicate the position of the main recombinant 
protein product (A) and the N. meningitidis immunoreactive band (B). These experiments 
confirm that 128 is a surface-exposed protein and that it is a useful immunogen. The 
hydrophilicity plots, antigenic index, and amphipatic regions of ORF 128 are provided in 
Figure 15. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
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Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et aL 
1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 128 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 8 
Expression of ORF 206 
The primer described in Table 1 for ORF 206 was used to locate and clone ORF 206. 
The predicted gene 206 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 206- 
His purification. Mice were immunized with the purified 206-His and sera were used for 
Western blot analysis (panel B). It is worthnoting that the immunoreactive band in protein 
extracts from meningococcus is 38 kDa instead of 17 kDa (panel A). To gain information on 
the nature of this antibody staining we expressed ORF 206 in E. coli without the His-tag and 
including the predicted leader peptide. Western blot analysis on total protein extracts from E. 
coli expressing this native form of the 206 protein showed a recative band at a position of 38 
kDa, as observed in meningococcus. We conclude that the 38 kDa band in panel B) is specific 
and that anti-206 antibodies, likely recognize a multimeric protein complex. In panel C is 
shown the FACS analysis, in panel D the bactericidal assay, and in panel E) the ELISA assay. 
Results show that 206 is a surface-exposed protein. Symbols: Ml, molecular weight marker; 
TP, N. meningitidis total protein extract; OMV, N. meningitidis outer membrane vesicle 
preparation. Arrows indicate the position of the main recombinant protein product (A) and the 
N. meningitidis immunoreactive band (B). These experiments confirm that 206 is a surface- 
exposed protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, 
and amphipatic regions of ORF 519 are provided in Figure 16. The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1 :9). 
The nucleic acid sequence of ORF 206 and the amino acid sequence encoded thereby is 
provided in Example 1. 
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EXAMPLE 9 
Expression of QRF 287 
The primer described in Table 1 for ORF 287 was used to locate and clone ORF 287. 
The predicted gene 287 was cloned in pGex vector and expressed in E coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 287- 
GST fusion protein purification. Mice were immunized with the purified 287-GST and sera 
were used for FACS analysis (panel B), bactericidal assay (panel C), and ELISA assay (panel 
D). Results show that 287 is a surface-exposed protein. Symbols: Ml, molecular weight 
marker. Arrow indicates the position of the main recombinant protein product (A). These 
experiments confirm that 287 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 287 are provided in 
Figure 17. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1 992, Scand J Immunol Suppl 11:9). The nucleic acid sequence of ORF 287 and the amino 
acid sequence encoded thereby is provided in Example 1. 



EXAMPLE 10 
Expression of ORF 406 
The primer described in Table 1 for ORF 406 was used to locate and clone ORF 406. 
The predicted gene 406 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 406- 
His fusion protein purification. Mice were immunized with the purified 406-His and sera were 
used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Results show that 406 is a surface-exposed protein. Symbols: 
Ml, molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. 
meningitidis outer membrane vescicle preparation. Arrows indicate the position of the main 
recombinant protein product (A) and the N. meningitidis immunoreactive band (B). These 
experiments confirm that 406 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 406 are provided in 
Figure 18. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989 J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al 
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1 992, Scand J Immunol Suppl 11:9). The nucleic acid sequence of ORF 406 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 1 1 



Table 2 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 225 among different strains. 
Table 2 



225 gene variability: List of used Neisseria strains 


Identification Strains 


Source / reference 


number 




Group B 




zoOl 225 NG6/88 


R. Moxon / Seiler et al., 1996 


zo02_225 BZ198 


R. Moxon / Seiler et al., 1996 




K. Moxon / Seller et al, 1996 


zo04 225 297-0 


R. Moxon / Seiler et al, 1996 


zo05 225 1000 


R. Moxon / Seiler et al, 1996 


zo06 225 BZ147 


R. Moxon / Seiler et al, 1 996 


zo07_225 BZ169 


R. Moxon / Seiler et al, 1996 


ZOUo LI. J jZo 


K. JVioxon / beiler et al., 1996 


zo09 225 NGP165 


R. Moxon / Seiler et al, 1996 


zolO 225 BZ133 


R. Moxon / Seiler et al, 1996 


zoll_225 NGE31 


R. Moxon / Seiler et al, 1996 




k. ivioxon / Seller et al, 1 996 


zol3 225 NGE28 


R. Moxon / Seiler et al, 1996 


zol4 225 NGH38 


R. Moxon / Seiler et al, 1 996 


zol5 225 SWZ107 


R. Moxon / Seiler et al, 1996 


zol6 225 NGH15 


R. Moxon / Seiler et al, 1996 


zol7 225 NGH36 


R. Moxon / Seiler et al, 1996 


zol8 225 BZ232 


R. Moxon / Seiler et al, 1996 


zol9 225 BZ83 


R. Moxon / Seiler et al, 1996 


zo20 225 44/76 


R. Moxon / Seiler et al, 1996 


zo21 225 MC58 


R. Moxon 


zo96_225 2996 


Our collection 


Group A 




zo22 225 205900 


R. Moxon 


zo23 225 F6124 


R. Moxon 


Z2491 Z2491 


R. Moxon / Maiden et al, 1998 


Group C 




zo24 225 90/18311 


R. Moxon 


zo25_225 93/4286 


R. Moxon 
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Others 

zo26_225 A22 (group W) R. Moxon / Maiden et al, 1998 
zo27_225 E26 (group X) R. Moxon / Maiden et al, 1998 
zo28_225 860800 (group Y) R. Moxon / Maiden et al, 1998 
zo29_225 E32 (group Z) R. Moxon / Maiden et al, 1998 



Gonococcus 

zo32_225 NgF62 
zo33_225 Ng SN4 

fal090 FA1090 



References: 



R. Moxon / Maiden et al, 1 998 
R. Moxon 



R. Moxon 



Seiler A. et al, Mol. Microbiol., 1996, 19(4):841-856. 
Maiden etal., Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 

>FA1090 <SEQ ID 3115> 

MDSFFKPAWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHI FKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRI SHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z2491 <SEQ ID 3116> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

nadeligsamglneqpvlpvnrvparragnadelignamglneqpvlpvnrvpaISa 

DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSISTGTDCSGF 

IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

ZO01_225 <SEQ ID 3117> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

nadeligsamglneqpvlpwrvparragnadelignamglneqpvlpvnrapa^r^Sa 
delignamgllgiayryggtsistgfdcsgtoqhifkpj^ginlprtsaeqarmgtpvaJ 

™P^L^ TLGGSRISHVGLYIGNNRFI ^ PRTGraiEITSLSHK ™ SG ^ A 
2O02_225 <SEQ ID 3118> 

mdsffkpavwavlwlmfavrpaladeltnllssreqilrqfaedeqpvlpinraparrag 
nadeligsamglneqpvlpwrvparragnadelignamglneqpJlpSS^Sa 
delignamgllgiayryggtsvstgfdcsgfmqhifki^ginlprtsaeoSrmg^aS 

SDOT™ 1KSHlmLnOTRFmpRTOTIEITSLSmMS ™ 
ZO03_225 <SEQ ID 3119> 

mdsffkpavwavlwlmfavrlaladeltnllssreqilrqfaedeqpvlpinraparrag 
nadeligsamglneqpvlpvnrvparragnadelignamglneqpvlpwpapar^Sa 
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ZO04_225 <SEQ ID 3120> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

ZO05_225 <SEQ ID 3121> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGSAMGLNEQPVLFVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO06_225 <SEQ ID 3122> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

2O07_225 <SEQ ID 3123> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO08_225 <SEQ ID 3124> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGSAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO09_225 <SEQ ID 3125> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO10_225 <SEQ ID 3126> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAE.QARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZOH_225 <SEQ ID 3127> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLG IAYRYGGTSVSTGFDCSGF 

MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
I HAPRTGKN I E I T S LS HK YW S GKY AFARR VKKN DPS RFLN * 

Z012_225 <SEQ ID 3128> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z013_225 <SEQ ID 3129> 
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MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFIQHIFKRAMGINLPRTSAEQARMGTPVAR 

VK^DPSRFLN* TLGGSRISHVGLYIG 
Z014_225 <SEQ ID 3130> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z015_225 <SEQ ID 3131> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z016J225 <SEQ ID 3132> 

MDS FFKPAVWAVLWLMFAVRPALADE LTNLLS SREQI LRQFAE DEQPVLP INRAP ARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

Z017_225 <SEQ ID 3133> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z018_225 <SEQ ID 3134> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

V^DPS^L^ TLGGS ^ 
Z019_225 <SEQ ID 3135> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQI LRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
VK^DPSRFL™ TLGGSRI SHVGLYIGNNRFI ^ PRTGraiEITSLSHKYWS GKYAFARR 

ZO20_225 <SEQ ID 3136> 
MDSFFKPAVWAVLWLMFAVRPAI 

NADELIGSAMGLNEQPVLPINRAir/mK Al iNAUiiI,IGSAMGLNEQPVLPVNRVPARRAGNA 

DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 
MQHIFKRAMGINLPRTSAEOARMGTPVARSF.T.nPf=nM^7TrirDT>T pr^romr^ 



MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPINRAPARRAGNADELIGSAMGLNEQPVLPVNRVPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
IHAPRTGKNIEITSLSHKYWSGKYAFARR VKKNDPSRFLN* 

Z021_225 <SEQ ID 3137> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

Z022_225 <SEQ ID 3138> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

nadeligsamglneqpvlpvnrvparragnadelignamgi^eqpvlpvnrapa^Sa 

DELIGNAMGLLGIAYRYGGTS I STGFDCSGFMQHI FKRAMGINLPRTSAEQARMGTPVAR 
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SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z023_225 <SEQ ID 3139> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

2024_225 <SEQ ID 3140> 

MDSFFKPAWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z025_225 <SEQ ID 314I> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

2026_225 <SEQ ID 3142> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

Z027_225 <SEQ ID 3143> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z028_225 <SEQ ID 3144> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z029_225 <SEQ ID 3145> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z032_225 <SEQ ID 3146> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKN I E IT S L SHKYWSGKYAFARRVKKNDPS RFLN * 

Z033_225 <SEQ ID 3147> 

MDSFFKPAVWAVLWLMFAVRSALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRIKKNDPSRFLN* 

Z096 225 <SEQ ID 3148> 
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MDSFFKPAWAVLWLMFAVRPAIADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 



Figure 19 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 225, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 12 



Table 3 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 235 among different strains. 
Table 3 



235 gene variability: List of used Neisseria strains 


Identification Strains 


Reference 


number 




Group B 




gnmzqOl NG6/88 


Seiler e* a/., 1996 


gnmzq02 BZ198 


Seiler ef a/., 1996 


gnmzq03 NG3/88 


Seiler etal., 1996 


gnmzq04 1000 


Seiler etal., 1996 


gnmzq05 1000 


Seiler etal., 1996 


gnmzq07 BZ169 


Seiler et al., 1996 


gnmzq08 528 


Seiler et al., 1996 


gnmzq09 NGP165 


Seiler etal., 1996 


gnmzqlO BZ133 


Seiler et al., 1996 


gnmzqll NGE31 


Seiler etal., 1996 


gnmzql3 NGE28 


Seiler et al., 1996 


gnmzql4 NGH38 


Seiler et al., 1996 


gnmzqlS SWZ107 


Seiler et al., 1996 


gnmzql6 NGH15 


Seiler et al., 1996 


gnmzql7 NGH36 


Seiler et al., 1996 


gnmzql8 BZ232 


Seiler et al., 1996 


gnmzql9 BZ83 


Seiler et al., 1996 


gnmzq21 MC58 


Virji etal., 1992 


Group A 




gnmzq22 205900 


Our collection 
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gnmzq23 F6124 
z2491 Z2491 


Our collection 
Maiden etaL, 1998 


Group C 

gnmzq24 90/18311 
gnmzq25 93/4286 


Our collection 
Our collection 


Others 

gnmzq26 A22 (group W) Maiden et aL, 1998 
gnmzq27 E26 (group X) Maiden et aL, 1998 
gnmzq28 860800 (group Y) Maiden et al, 1998 
gnmzq29 E32 (group Z) Maiden et al, 1998 
gnmzq31 K lactamica Our collection 


Gonococcus 

gnmzq32 Ng F62 
gnmzq33 Ng SN4 


Maiden et aL 9 1998 
Our collection 


fal090 FA1090 


Dempsey et al. 1991 


References: 




Seiler A. et aL 9 Mol. Microbiol, 1996, 19(4):841-856. 

Maiden R. et ah, Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 

Virji M. etal., Mol. Microbiol., 1992, 6:1271-1279 

Dempsey J.F. etal., J. Bacterid., 1991, 173:5476-5486 



The amino acid sequences for each listed strain are as follows: 



FA1090 <SEQ ID 3149> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 
AAPISEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

GNMZQ01 <SEQ ID 3150> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ02 <SEQ ID 3151> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ03 <SEQ ID 3152> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LWPPLNE S PDVNGTWGVLAST 
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AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 

YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNOIANSLT 
DRG YQVSKTAAYNLLS PYSHNGI LKG PRFVEEQPK * ^vvwyiANSLT 

GNMZQ04 <SEQ ID 3153> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 

AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 

YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* ^v^vvwuxANSLT 

GNMZQ05 <SEQ ID 3154> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAP L S E AG YYV FPAAWEE T FKENGLTN AADI HAVR PEKLHQ I FGNDAVL Y ITVTEYGTS 

YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRG YQVSKTAAYN LLS PYSHNGI LKG PR FVEEQPK * U NLT 

GNMZQ07 <SEQ ID 3155> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 

AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 

YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* ^v*AVVNQIANSLT 

GNMZQ08 <SEQ ID 3156> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 

AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 

YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQVSKTAAYNLLS PYSHNG I LKGPRFVEEQPK* ^"vwyiflwwLi 

GNMZQ09 <SEQ ID 3157> 

MKPLILGLAAALVLSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 

AEPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVQPEKLHQIFGNDAVLYITITEYGTS 

YQILDSVTTVSARARLVDSRNGKVLWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
' DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* mvvmumhslt 

GNMZQ10 <SEQ ID 3158> 

MKPLILGLAAVLALSACQVOKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 

AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 

YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* /u-vsavvnuiansLT 

GNMZQ11 <SEQ ID 3159> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 

AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 

YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNG I LKGPRFVEEQPK * oAWNylANSLT 

GNMZQ13 <SEQ ID 3160> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 

yqildsvttvsakarlvdsrngkelwsgsasiregsnnsnsgllgalvsavtoqians£t 

DRGYQVSKTAAYNLL SPYSHNG I LKGPRFVEEQPK * ^vanvvNQIANSLT 
GNMZQ14 <SEQ ID 3161> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 

DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* vnuwxsui 
GNMZQ15 <SEQ ID 3162> 

^f'^ GLAAVLALSACQV Q KAPDFDYT SFKESKPASILVVPPLNESPDVNGTWGVLAST 

AAPLSEAGYWFPAA^ETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 

YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAV^Q^ 
DRGYQVS KTAAYNLLS PYSHNG I LKG PR FVEEQPK * ^wu-v&AWNQIMISLT 

GNMZQ16 <SEQ ID 3163> 
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MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ17 <SEQ ID 3164> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESFDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRG YQVSKTAAYNLLS P YSHNG I LKG PRFVEEQPK * 

GNMZQ18 <SEQ ID 3165> 

MKPLI LGLAAVLAL S ACQVQKAP D FD YTS FKE SKPAS I LWPPLNE S PDVNGTWG VLAS T 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRG YQVS KTAAYNL LS P Y S HNG I LKG PR FVEEQPK * 

GNMZQ19 <SEQ ID 3166> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ21 <SEQ ID 3166> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKE SKPAS I LVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ22 <SEQ ID 3167> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSE AG YYVFPAAWEETFKQNGLTNAAD I HAVRPEKLHQ I FGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRG YQVS KTAAYNL LS PYS HNG I LKG PR FVEEQPK * 

GNMZQ23 <SEQ ID 3168> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPAS I LVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ24 <SEQ ID 3169> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPAS I LVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ25 <SEQ ID 3170> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKE SKPAS I LVVPPLNESPDVNGTWGVLAST 
AAPLSE AGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQ I FGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNG I LKG PRFVEEQPK* 

GNMZQ26 <SEQ ID 3171> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGI LKG PRFVEEQPK* 

GNMZQ27 <SEQ ID 3172> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LWPPLNE S PDVNGTWGVLAS T 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 
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GNMZQ28 <SEQ ID 3173> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNM2Q29 <SEQ ID 3174> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRG YQVSKTAAYNLLS PYSHNG I LKGPRFVEEQPK* 

GNMZQ31 <SEQ ID 3175> 

MKPLILGLAAVLALSACQVQKAPDFDYTAFKESKPASILWPPLNESPDVNGTWGMLAST 
AEPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITITEYGTS 
YQILDSVTTVSARARLVDSRNGPCVLWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKAAAYDLLSPYSHNGILKGPRFVEEQPK* 

GNM2Q32 <SEQ ID 3176> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 
AAPISEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

GNMZQ33 <SEQ ID 3177> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 
AAPI SEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

Z2491 <SEQ ID 3178> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 



Figure 20 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 235, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



Table 4 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 287 among different strains. 
Table 4 

287 gene variability: List of used Neisseria strains " 



EXAMPLE 13 



Identification Strains 
number 



Reference 
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Group B 



287_2 
287_9 
287_14 
287 21 



BZ198 
NGP165 
NGH38 
MC58 



Seilerera/., 1996 
Seilere/a/., 1996 
Seilerefa/., 1996 
Virji etal, 1992 



22491 



Group A 

Z2491 



Maiden et aL 9 1998 



Gonococcus 

fal 090 FA1 090 Dempsey et al. 1 99 1 



References: 

Seiler A. et al 9 Mol. Microbiol., 1996, 19(4):841-856. 

Maiden R. et al 9 Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 

Virji M. et aL 9 Mol. Microbiol., 1992, 6:1271-1279 

Dempsey J.F. et aL, J. Bacteriol., 1991, 173:5476-5486 



The amino acid sequences for each listed strain are as follows: 



287_14 <SEQ ID 3179> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAFVVSEKETEAKEDAPQAGSQG 
QGAPSAQGGQDMAAVSEENTGNGGAAATDKPKNEDEGAQNDMPQNAADTDSLTPNHTPAS 
NMPAGNMENQAPDAGESEQPANQPDMANTADGMQGDDPSAGGENAGNTAAQGTNQAENNQ 
TAGSQNPASSTNPSATNSGGDFGRTNVGNSWIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSE FEKLS DADK I SN YKKDGKN DGKN DKFVGLVADS VQMKG INQ Y 1 1 F YK PKPT S FAR 

FRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLP 

GGSYALRVQGEPSKGEMLAGTAVYNGEVLHFHTENGRPSPSRGRFAAKVDFGSKSVDGII 

DSGDGLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD* 

287_2 <SEQ ID 3180> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPVVSEKETEAKEDAPQAGSQG 

QGAPSAQGGQDMAAVSEENTGNGGAAATDKPKNEDEGAQNDMPQNAADTDSLTPNHTPAS 

NMPAGNMENQAPDAGESEQPANQPDMANTADGMQGDDPSAGGENAGNTAAQGTNQAENNQ 

TAGSQNPASSTNPSATNSGGDFGRTNVGNSWIDGPSQNITLTHCKGDSCSGNNFLDEEV 

QLKSEFEKLSDADKISNYKKDGKNDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFAR 

FRRSARSRRSLPAEMPLI PVNQADTLI VDGEAVSLTGHSGNI FAPEGNYRYLTYGAEKLP 

GGSYALRVQGEPSKGEMLAGTAVYNGEVLHFHTENGRPSPSRGRFAAKVDFGSKSVDGII 

DSGDGLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD* 

287_21. <SEQ ID 3181> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPVVSEKETEAKEDAPQAGSQG 
QGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGTDSSTPNHTPDP 
NMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTAAQGANQAGNNQ 
AAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFARFRRS 
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ARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSY 
ALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGSKSVDGIIDSGD 
DLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEKGGFGV 

287_9 <SEQ ID 3182> 

MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPAAPVVTEDVGEEVLPKEKKDEEA 
VSGAPQADTQDATAGKGGQDMAAVSAENTGNGGAATTDNPENKDEGPQNDMPONAADTDS 
STPNHTPAPNMPTRDMGNQAPDAGESAQPANQPDMANAADGMQGDDPSAGENAGNTADOA 
ANQAENNQVGGSQNPASSTNPNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKVCDR 
DFLDEEAPPKSEFEKLSDEEKINKYKKDEQRENFVGLVADRVEKNGTNKYVIIYKDKSAS 
SSSARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYG 
AEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPSPSGGRFAAKVDFGSKS 

SSggSaS 
FA1090 <SEQ ID 3183> 

MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPWAENAGEGVLPKEKKDEEA 
AGGAPQADTQDATAGEGSQDMAAVSAENTGNGGAATTDNPKNEDAGAQNDMPQNAAESAN 
QTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSWIDGPSQNITLTHCKGDSCNGDN 
LLDEEAPSKSE FEKLS DEEKIKRYKKDEQRENFVGLVADRVKKDGTNKYI I FYTDKPPTR 

SARSRRSLPAEIPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGS 
YALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRFAAKVDFGSKSVDGIIDSG 

VFAG^DRD*^ IDGNGreGTWTENGGGDVSGRFYGPAGEEVAGKY 
Z2491 <SEQ ID 3184> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSOG 

QGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGTDSSTPNHTPDP 

NMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTAAQGANQAGNNO 

AAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNNFLDEEV 

QLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFARFRRS 

ARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSY 

ALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGSKSVDGI I DSGD 

DLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEKGGFGV 
FAGKKEQD* 



Figure 21 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 287, further confirming its utility as an antigen for both 
vaccines and diagnostics. 

EXAMPLE 14 

Table 5 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 519 among different strains. 
Table 5 

519 gene variability: List of used Neisseria strains " — 



Identification Strains Source / reference 



.8957280 A2J_> 



WO 99/57280 
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PCT/US99/09346 



number 








Group B 




zvOl. 


519 


NG6/88 


R. Moxon / Seiler et al., 1996 


zv02_ 


_519 


BZ198 


R. Moxon / Seiler et al., 1996 


zv03~ 


~519ass 


NG3/88 


R. Moxon / Seiler et al., 1996 


zv04. 


~519 


297-0 


R. Moxon / Seiler et al., 1996 


zv05_ 


_519 


1000 


R. Moxon / Seiler et al, 1996 


zv06_ 


519ass 


BZ147 


R. Moxon / Seiler et al., 1996 


zv07_ 


"519 


BZ169 


R. Moxon / Seiler et al., 1996 


zvll" 


519 


NGE31 


R. Moxon / Seiler et al., 1996 


zvl2 


519 


NGF26 


R. Moxon / Seiler et al., 1996 


zvl8~ 


519 


BZ232 


R. Moxon / Seiler et al., 1996 


zvl9_ 


.519 


BZ83 


R. Moxon / Seiler et al., 1996 


zv20 


_519ass 


44/76 


R. Moxon / Seiler et al., 1996 


zv21~ 


519ass 


MC58 


R. Moxon 


zv96_ 


519 


2996 


Our collection 



Group A 

zv22_519ass 205900 
Z2491 519 Z2491 



R. Moxon 
R. Moxon / Maiden et al, 1998 



Others 

zv26_5 1 9 A22 (group W) R. Moxon / Maiden et al., 1 998 
zv27_5 1 9 E26 (group X) R. Moxon / Maiden et al, 1 998 
zv28_519 860800 (group Y) R. Moxon / Maiden et al, 1998 
zv29_5 1 9ass E32 (group Z) R. Moxon / Maiden et aL, 1 998 



Gonococcus 

zv32_519 Ng F62 

fal090 519 FA1090 



R. Moxon / Maiden et al, 1998 
R. Moxon 



References: 

Seiler A. et al, Mol. Microbiol., 1996, 19(4):841-856. 
Maiden et al., Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090_519 <SEQ ID 3185> 

MEFFII LLAAVAVFG FKSFWI PQQE VHWERLGRFHRALT AGLN ILIPFI DRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRAMQAQITAERE 

KRARIAESEGRKIEQINLASGQREMIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 
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Z2491_519 <SEQ ID 3186> 

MEFFI I LLAAVWFGFKS FWI PQQEVHVVERLGRFHRALTAGLNILI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

lISSsSK^ LQTQGGADAWLKI ^ QW ^ raN ^ ESNTL1MP ^ VADIGSL 



ZV01_519 <SEQ ID 3187> 

MEFFI ILLVAVAVFGFKSFVVIPQQEVHWERLGRFHRALTAGLNILIPFI DRVAYRHSL 
KEIPLDVPSOVCITRDNTOLTvnCTTVirnvTnDVT7 lt! vr [ . t .»,vT.,. T ll.i."^ 



. ^^.nm.wiiui v v J-rWJWttvvkKLGRFHRAIiTAGLNILIPFIDRVAYRHSI 
—IPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 



ZV02_519 <SEQ ID 3188> 

MEFFI ILLVAVAVFGFKSFWI PQQEVHVVERLGRFHRALTAGLNILI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

ISAGMKI^SSKTA^ LQTQGGADAWLKIAEQWM ™ N ^ E ^ 
ZV03_519 <SEQ ID 3189> 

MEFFI ILLVAVAVFGFKSFWI PQQEVHWERLGRFHRALTAGLNILIPFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

ISAGMKIIDSSKT A ^ LQTQGGADAWLKIAEQYVM ^^ ESNT 
ZV04_519 <SEQ ID 3190> 

MEFFI ILLVAVAVFGFKSFWI PQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
[jDVPSQVCITRDNTQLTVDGIIYFOVTnPK'T.asvf^QQMVTMn rmnt » 



KEI 



„i. ,ut » vic^l vnvvjiK^Kt MRAiTAGLNILIPFIDRVAYRHSL 

PLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

isagmk^ssktai^ QTQ ^ 

ZV05_519 <SEQ ID 3191> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

IS^J^^sKTM^^ 
ZV06_519ASS <SEQ ID 3192> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE IN ST VFS AL DEAAGAWGVKVLR YE I KDLVP PQE I LRSMQAQITAERK 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
ISAGMKIIDSSKTA^ QTQGGADA ^ LKIAEQWAA ^^ ES 

ZV07_519 <SEQ ID 3193> 

MEFFI ILLVAVAVFGFKSFWI PQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

rmeldktfeerdeinstwaaldeaagawgvkvlryeikdlvppqeilrsmqaqitaeS 

KRARIAESEGRKIEQINLASGQREAEIOOSEGEAnnavMn T ^"ZZZZtt^Z 




I SAGMKI I DSSKTAK 
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2V11_519 <SEQ ID 3194> 

ME FFI ILLAAVAVFG FKS FWI PQQE VH WERLGRFHRALT AGLN I L I P FI DRV AYRH S L 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DS SKTAK * 

ZV12_519 <SEQ ID 3195> 

ME FFI I LLVAVAVFG FKS FWI PQQE VH WERLGRFHRALT AGLN I LI PFI DRV AYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI IDS SKTAK* 

ZV18_519 <SEQ ID 3196> 

ME F FI I LLVAVAVFG FKS FW I PQQEVHWERLGRFHRALTAGLNI LI PFI DRV AYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI IDS SKTAK* 

ZV19_519 <SEQ ID 3197> 

ME FFI I LLVAVAVFGFKS FWI PQQEVHWERLGRFHRALTAGLNI LI PFI DRV AYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DS S KTAK * 

ZV20_519ASS <SEQ ID 3198> 

MEFFI I LLVAVAVFGFKS FWI PQQEVHWERLGRFHRALTAGLNI LI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAE ANAEAI RQI AAALQTQGGADAVN LKI AEQ YVAAFNNLAKE SNT L IMPANVAD IG SM 
I SAGMKI I DS SKTAK* 

ZV21_519ASS <SEQ ID 3199> 

ME F FI I LLVAVAVFGFKS FW I PQQE VH WERLGR FHRALTAGLN I L I PFI DRVAYRH S L 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI IDS SKTAK* 

ZV22_519ASS <SEQ ID 3200> 

ME FFI I LLAAVWFGFKS FWI PQQEVHWERLGRFHRALTAGLNI LI PFI DRVAYRHSL 
KEI PLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAKIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DS S KTAK * 

ZV26_519 <SEQ ID 3201> 

ME FFI I LLAAVWFGFKS FWI PQQEVHWERLGRFHRALTAGLNI LI PFI DRVAYRHSL 
KEI PLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE INST WAALDEAAGAWG VKVLRYE IKDLVP PQE I LRSMQAQI TAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAI RQI AAALQTQGGADAVN LK I AEQYVAAFNNLAKE SNT LIMPANVADIGSL 
I SAGMKI I DS SKTAK * 

ZV27 519 <SEQ ID 3202> 
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MEFFII LLVAVAVFGFKS FW I PQQEVH WERLGRFHRALTAGLNI LI PFI DRVAYRHS I 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

ZV28_519 <SEQ ID 3203> 

MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHST 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

ZV29_519ASS <SEQ ID 3204> 

MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSIWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAOITAERE 

KRARIAESEGRKIEQINLASGQREPEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQI AAALQTQGGADAVNLKIAEQYVAAFNNLAKE SNTLIMPANVAD I G S L 
I SAGMKI I DSNKTAK* nr/invAuibSL 

ZV32_519 <SEQ ID 3205> 

MEFFI I LLAAVAVFG FKS FW I PQQEVH WERLGR FHRALT AG LN I L I P FI DRVAYRH S L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRAMQAQITAERE 
KRARIAESEGRKIEQINIASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

isagmkiidssktaI^^ 

ZV96_519 <SEQ ID 3206> 

ME FFI I LIAAVAVFGFKS FW I PQQEVH WERLGRFHRALTAGLN I L I P FI DRVAYRH S I 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYI^ITQLAQTTLRSVIG 
RMELDKT FEERDE INSTWAALDEAAGAWGVKVLRYE IKDLVPPQEI LRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

Figure 22 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 519, farther confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 15 

Table 6 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 919 among different strains. 
Table 6 



919 gene variability: List of used Neisseria strains 
Identification Strains Source / reference 
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number 

Group B 

zmOl NG6/88 
zm02 BZ198 
zm03 NG3/88 
zm04 297-0 
zm05 1000 
zm06 BZ147 
zm07 BZ169 
zm08n 528 
zm09 NGP165 
zmlO BZ133 
zmllasbc NGE31 
zml2 NGF26 
zml3 NGE28 
rml4 NGH38 
onl5 SWZ107 
zml6 NGH15 
zml7 NGH36 
zml8 BZ232 
zml9 BZ83 
zm20 44/76 
zm21 MC58 
zm96 2996 



R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et al, 1996 
R. Moxon / Seiler et al, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon / Seiler et al, 1996 
R. Moxon / Seiler et aL, 1996 
R. Moxon 
Our collection 



Group A 

zm22 205900 
zm23asbc F6124 
z2491 Z2491 



R. Moxon 
R. Moxon 

R. Moxon / Maiden et aL, 1998 



Group C 

zm24 90/18311 
zm25 93/4286 



R. Moxon 
R. Moxon 



Others 

zm26 A22 (group W) R. Moxon / Maiden et aL, 1998 
zm27bc E26 (group X) R. Moxon / Maiden et aL, 1998 
zm28 860800 (group Y) R. Moxon / Maiden et aL, 1998 
zm29asbc E32 (group Z) R. Moxon / Maiden et aL, 1998 
zm31asbc N. lactamica R. Moxon 

Gonococcus 

zm32asbc Ng F62 R. Moxon / Maiden et aL, 1998 

zm33asbc Ng SN4 R. Moxon 



fal090 FA1090 R. Moxon 
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References: 



Seiler A. etal, Mol. Microbiol., 1996, 19(4):841-856. 
Maiden et aL, Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090 <SEQ ID 3207> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 

YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKN 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTFSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 

IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTT G YVWQLL PNGMKPE YRP * 

Z2491 <SEQ ID 3208> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTG YVWQLL PNGMKPE YRP * 

ZM01 <SEQ ID 3209> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTG YVWQLL PNGMKPE YRP* 

2M02 <SEQ ID 3210> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQG IKS YMRQNPQRLAEVLGQNPSYIFFRE LAGS SNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLFNGMKPEYRP* 

ZM03 <SEQ ID 3211> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRIjAEVLGQNPSYIFFRE LAGS SNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTG YVWQLL PNGMKPE YRP* ^ 

ZM04 <SEQ ID 3212> 
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MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTG YVWQLL PNGMKPE YRP * 

2M05 <SEQ ID 3213> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLSCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMRQNPQRIAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM06 <SEQ ID 3214> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM07 <SEQ ID 3215> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG IKS YMRQN PQRLAE VLGQNPS Y I FFRE LAGS SNDG PVGALGT PLMGE YAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM08N <SEQ ID 3216> 

MKICYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFraHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLG QT S MQG I KAYMRQN PQRLAE VLGQNPS Y I FFRELAG S SN D G PVGALGT P LMGE YAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

2M09 <SEQ ID 3217> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLS LPHWAAQD FAKS LQS FRLGCANLKNRQGWQDVCAQAFQT PVHS FQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYE PVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

2M10 <SEQ ID 3218> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKS LQS FRLGCANLKNRQGWQDVCAQAFQT PVHS FQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
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KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM11ASBC <SEQ ID 3219> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM12 <SEQ ID 3220> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGPCAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM13 <SEQ ID 3221> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAEQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAP I LG YAE D PVE LFFMH I QG S GRLKT P S GK YI R I G YADKNEH P Y V S I GR YMADKG YL 

KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM14 <SEQ ID 3222> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLG CAN LKNRQGWQDVCAQAFQT PVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSRNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM15 <SEQ ID 3223> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDLAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNHQGWQDVCAQAFQT PVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM16 <SEQ ID 3224> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPGRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM17 <SEQ ID 3225> 



BNSDOCID: <WO 9957280A2_L> 



WO 99/57280 



# 



PCT/US99/09346 



137 



MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLL PNGMKPE YRP * 

ZM18 <SEQ ID 3226> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM19 <SEQ ID 3227> 

MKKYLFRAALYG I AAAI LAACQSKS I QT FPQPDT S V ING PDRPVG I P D P AGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLL PNGMKPE YRP * 

ZM20 <SEQ ID 3228> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLL PNGMKPE YRP* 

ZM21 <SEQ ID 3229> 

MKKYLFRAALYG I AAAI LAACQSKS I QTFPQPDTSVINGPDRPVG I PDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM22 <SEQ ID 3230> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM23ASBC <SEQ ID 3231> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
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KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTSKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKEPGYVWQLLPNGMKPEYRP* wiwcitu,*. 

ZM24 <SEQ ID 3232> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFOAKOFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 

vdrhyitlgaplfvatahpvtrkalnrlimaqdtgsaikgavrvdyfwgygdeageSgk 

QKTTGYVWQLLPNGMKPEYRP* "^w.^ 
ZM25 <SEQ ID 3233> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFOAKOFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAODTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* ^uuiMLAM 

ZM2 6 <SEQ ID 3234> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVOAKOFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMQQNPQRLAEVLGQMPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* ^^tlAfaK 

ZM27BC <SEQ ID 3235> 

MKKYLFRAALYGISAAILAACQSKSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKOFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKEPGYVWQLLPNGMKPEYRP* «*ii*Ui!.Afc,ETAGK 

ZM28 <SEQ ID 3236> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

yftpwqvagngsiagtvtgyyepvucgddrrtaqarfpiygipddfisvplpSsgS 
lvrirqtgknsgtidnt^thtadlsqfpitarttaikgrfegsrflpyhtrnqSgga? 
dgkapilgyaedpvelffmhiqgsgpxktpsgkyirigyadknehpyvsigSSkg^ 
klgqtsmqgikaymrqnpqrlaevlgqnpsyiffreiagssndgpvgalgtplmgeyaga 

SgyvwIlS 
zm29asbc <seq id 3237> 

mkkylfraalcgiaaailaacqsksiqtfpqpdtsvingpdrpvgipdpagttvggggav 
ytwphlslphwaaqdfakslqsfrlgcanlknrqgwqdvcaqafqtpvhsfqakoffer 
yftpwqvagngsiagtvtgyyepvijcgddrrtaqarfpiygipddfisvplpaS 
lvrirqtgknsgtidntggthtadlsqfpitarttaikgrfegsrflpyhtrnqinggal 

klgqtsmqgiksymrqnpqrlaevlgqnpsyiffreltgsgndgpvgalgtpSSeSS 

zm31asbc <seq id 3238> 
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MKKHLFRAALYGIAAAILAACQSKSIQTFPQPDTSIIKGPDRPAGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPWSIGRYMADKGYL 

KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYVFFRELAGSGNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM32ASBC <SEQ ID 3239> 

MKKHLLRSALYGIAAAI LAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YT WPHLSMPHWAAQD FAKSLQS FRLG CANLKNRQGWQDVC AQAFQT PVHS FQAKRFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGGDGPVGALGTPLMGGYAGA 

IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

2M33ASBC <SEQ ID 324 0> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PI HS FQAKRFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKN 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPHKLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 

IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM96 <SEQ ID 3241> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 



Figure 23 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 919, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 16 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PCR) assay in order to clone the ORFs as 
indicated: 



Table 7: Oligonucleotides used for PCR to amplify complete or partial ORFs 
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ORF primer Sequence 



001 Forward CGC GGATCCCATATG -TGGATGGTGrTfyrrr at 



Restriction 
sites 



Reverse 
003 Forward 

Reverse 

005 Forward 

Reverse 

006 Forward 

Reverse 

007 Forward 

Reverse 

008 Forward 

Reverse 

009 Forward 



CCCGCTCGAG-TGCCGTCTTGTCCCAC 
CGCGGATCCCATATG-GTCGTATTCGTGGC 

CCCGCTCGAG-AAAATCATGAACACGCGC 
CGC GGATCCCATATG -GACAAT ATTO A C. A tot 

CCCGCTCGAG-CATCACATCCGCCCG 
CGCGGATCCCATATG-CTGCTGGTGCTGG 

CCCGCTCGAG-AGTTCCGGCTTTGATGT 
CGC GGATCCCATATG -GCCGACA AC. Anr Air at 

CCCGCTCGAG-AAGGCGTTCATGATATAAG 
CGCGGATCCCATATG-AACAACAGACATTTTG 

CCCGCTCGAG-CCTGTCCGGTAAAAGAC 
CGCGGATCCCATATG-CCCCGCGCTGCT 



Reverse CCCGCTCGAG-TGGCTTTTGCCACGTTTT 

011 Forward CG CGGATCCCATATG -AAGACACA crnr A a n 

Reverse CCCGCTCGAG-GGCGGTCAGTACGGT 

012 Forward CGC GGATCCCATATG -CTCGGmnTTfrPP 

Reverse CCCGCTCGAG-AGCGGGGAAGAGGCAC 

013 Forward CG CGGATCCCATATG -CCTTTG AC.C A rarr 

Reverse CCCGCTCGAG-CTGATTCGGCAAAAAAATCT 
018 Forward CGCGGATCCCAIATG-CAGCAGAGGCAGTT 



Reverse 
019 Forward 

Reverse 
023 Forward 

Reverse 
025 Forward 

Reverse 
03 1 Forward 



CCCGCTCGAG-GACGAGGCGAACGCC 

AAAGAATTC-CTGCCAGCCGGCAAGACCCCGGC 

AAACTGCAG-TCAGCGGGCGGGGACAATGCCCAT 

AAAGAATTC-AAAGAATATTCGGCATGGCAGGC 

AAACTGCAG-TTACCCCCAAATCACTTTAACTGA 

AAAGAATTC-TGCGCCACCCAACAGCCTGCTCC 

AAACTGCAG-TCAGAACGCGATATAGCTGTTCGG 

CGCGGATCCCATAJG-GTCTCCCTTCGCTT 



Reverse CCCGCTCGAG-ATGTAAGACGGGGACAAC 
032 Forward CGCGGATCCCATATG-CGGCGA AArciTrir 



BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 
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Reverse CCCGCTCGAG-CTGGTTTTTTGATATTTGTG 

033 Forward CG CGGATCCCATATG -GCGGCGGCAGACA 

Reverse CCC GCTCGAG -ATTTGCCGCATCCCGAT 

034 Forward CG CGGATCCC ATATG -GCCG AAA AC A GCTT A Can 

Reverse CCC GCTCGAG -TTTGACGATTTGGTTCAATT 
036 Forward CGC GGATCCCATATG -CTGAAGCCGTGCG 

Reverse CCCGCTCGAG-CCGGACTGCGTATCGG 

038 Forward CGC GGATCCC AT ATG -ACCGATTTCCGCC A 

Reverse CCCGCTCGAG-TTCTACGCCGTACTGCC 

039 Forward CG CGGATCCCATATG -CCGTCCGAACCGC 

Reverse CCCGCTCGAG-TAGGATGACGAGGTAGG 

041 Forward CG CGGATCCC AT ATG -TTCGTGCGCG A A CC.CIC. 

Reverse CCCGCTCGAG-GCCCAAAAACTCTTTCAAA 

042 Forward CGC GGATCCCATATG -ACGATGATTTGCTTGC 

Reverse CCCGCTCGAG-TTTGCAGCCTGCATTTGAC 

043 Forward AAAAA AGGTACC -ATGGTTGTTTCAAATCA A A AT ATP 

Reverse AAACTGCAG-TTATTGCGCTTCACCTTCCGCCGC 

043a Forward AAAAAAGGTACC^GCAAAAGTGCATGGCGGCTTGGACGGTGC 
Reverse AAAAA ACTGCAG - 

TTAATCCTGCAACACGAATTCGCCCGTCCG 

044 Forward CGC GGATCCC ATATG -CCGTCCGACTAGAG 

Reverse CCCGCTCGAG-ATGCGCTACGGTAGCCA 

046 Forward AAAGAATTC-ATGTCGGCAATGCTCCCGACAAG 
Reverse AAACTGCAG-TCACTCGGCGACCCACACCGTGAA 

047 Forward CG CGGATCCCATATG -GTCATCATACAGGCG 

Reverse CCCGCTCGAG-TCCGAAAAAGCCCATTTTG 

048 Forward AA AGAATTC -ATGCTCAAC AAAGGCG A A G A ATTOrr 
Reverse AAACTGCAG-TCAAGATTCGACGGGGATGATGCC 

049 Forward AAAGAATTC-ATGCGGGCGCAGGCGTTTGATCAGCC 
Reverse AAACTGCAG-AAGGCGTATCTGAAAAAATGGCAG 

050 Forward CGC GGATCCCATATG -GGCGCGGGCTGG 

Reverse CCCGCTCGAG-AATCGGGCCATCTTCGA 
052 Forward AAAAAAGAATTC-ATGGCTTTGGTGGCGGAGGAAAC 

Reverse AAAAAAGJTCGAC-TCAGGCGGCGTTTTTCACCTTCCT 
052a Forward AAAAAAGAATTC-GTGGCGGAGGAAACGGAAATATCCGC 



Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

Kpnl 

PstI 

Kpnl 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

Sail 

EcoRI 
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073 



075 



077a 



080 



081 



082 



084 



085 



AAAAAACTGCAG-TTAGCTGTTTTTGGAAACGCCGTCCAACCC Pst I 



Reverse 

Forward CGCGGATCCCATATG-TGTATGCCATATAAGAT 



Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 



090 



091 



092 



093 



094 



095 



096 



Reverse CCCGCTCGAG-CACCGGATTGTCCGAC 
Forward CGC GGATCCCATATG -CCGTCTT A CTTr A rr 

CCCGCTCGAG-ATCACCAATGCCGATTATTT 
AAAAAAGAATTC-GGCGGCATTTTCATCGACACCTTCCT 
AAAAAACTGCAG-TCAGACGAACATCTGCACAAACGCAAT 
AAAGAATTC-GCGTCCGGGCTGGTTTGGTTTTACAATTC 
AAACTGCAG-CTATTCTTCGGATTCTTTTTCGGG 

AAAGAATTC-ATGAAACCACTGGACCTAAATTTCATCTG 
AAACTGCAG-TCACTTATCCTCCAATGCCTC 
AAAGAATTC-ATGTGGTTGTTGAAGTTGCCTGC 
AAACTGCAG-TTACGCGGATTCGGCAGTTGG 

AAAGAATTC-TATCACCCAGAATATGAATACGGCTACCG 
AAACTGCAG-TTATACTTGGGCGCAACATGA 
CGCGGATCCCATATG-GGTAAAGGGCAGGACT 

CCCGCTCGAG-CAAAGCCTTAAACGCTTCG 
AAAAAAGGTACC-TATTTGGCATCAAAAGAAGGCGG 
AAACTGCAG-TTACTCCACCCGATAACCGCG 
AAAGAATTC-ATGGGCGGTAAAACCTTTATGC 
AAACTGCAG-TTACGCCGCACACGCAATCGC 
AAAAAAGAATTC-AAGCTATTAGGCGTGCCGATTGTGATTCA 
AAAAAACTGCAG-TTACGCCTGCAAGATGCCCAGCTTGCC 
AAAAAAGAATTC-ATGTTTTTATGGCTCGCACATTTCAG 
AAAAAACTGCAG-TCAGCGGATTTTGAGGGTACTCAAACC 
CGCGGATCCCATATG-CCGCCCAAAATCAC 

CCCGCTCGAG-TGCGCATACCAAAGCCA 
CGCGGATCCCATATG-CGCATAGTCGAGCA 

CCCGCTCGAG-AGCAAAACGGCGGTACG 
AAAGAATTC-ATGGAAATAGCCGTACCGCCGAGTCC 
AAACTGCAG-TCAGCGCAGGGGGTAGCCCAAGCC 
AAAGAATTC-ATGTTTTTTATTTCAATCCG 
AAACTGCAG-TCAAATCTGTTTCGACAATGC 
AAAGAATTC-ATGCAGAATTTTGGCAAAGTGGC 
AAACTGCAG-CTATGGCTCGTCATACCGGGC 
AAAGAATTC-ATGCCGTCACGGAAGCGCATCAACTC 
AAACTGCAG-TTATCCCGGCCATACCGCCGAACA 
AAAGAATTC-ATGTCCTTTCATTTGAACATGGACGG 
AAACTGCAG-TCAACGCCGCAGGCACTAACGCCC 
AAAGAATTC-ATGGCTCGTCATACCGGGCAGGG 



Reverse 

086 Forward 
Reverse 

087 Forward 
Reverse 

087a Forward 
Reverse 

088 Forward 
Reverse 

089 Forward 



Reverse 
Forward 

Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 



BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

Pst I 

EcoRI 

Pst I 

EcoRI 

Pst I 

EcoRI 

Pst I 

EcoRI 

Pst I 

BamHI- 

Ndel 

Xhol 

Kpnl 

Pst I 

EcoRI 

Pst I 

EcoRI 

Pst I 

EcoRI 

Pst I 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

Pst I 

EcoRI 

Pst I 

EcoRI 

Pst I 

EcoRI 

Pst I 

EcoRI 

Pst I 

EcoRI 



<WO 99S7280A2_I_> 



WO 99/57280 PCT/US99/09346 

143 



Reverse AA ACTGCAG -TCAAAGGAAAAGGCCGTr.TPr A A A Anrn pgTi 

097 Forward AAAGAATTC-ATGGACACTTCAAAACAAACACTGTTG Eco RI 
Reverse AA ACTGC AG -TC AGCCC AAAT ACC A G A A TTTP A n Pst j 

098 Forward AAAGAATTC-GATGAACGCAGCCCAGCATGGATACG Eco RI 
Reverse AAACTGCAG-TTACGACATTCTGATTTGGCA PstI 

102 Forward AAAAAAGAATTC-GGCCTGATGATTTTGGAAGTCAACAC Eco RI 

Reverse AAAAAACTGCAG-TTATCCTTTAAATACGGGGACGAGTTC PstI 
105 Forward CGC GG ATCCC AT ATG -TCCGC AAACG A AT A ("!G BamHI- 

Ndel 

Reverse CCCGCTCGAG-GTGTTCTGCCAGTTTCAG Xhol 

107 Forward AAAAA AGAATTC - Eco RI 

CTGATGATTTTGGAAGTCAACACCCATTATCC 
Reverse AAAAAA CTGC AG -TT ATCCTTTAA ATA GGGGH A rn A n-rrr PstI 
107b Forward AAAAA AGAATTC - Eco RI 

GATACCCAAGCCCCCGCCGGCACAAACTACTG 
Reverse AAAAAA CTGCAG - PstI 

TTACGCGTCGCCTTTAAAGTATTTGAGCAGGCTGGAGAC 

108 Forward AAAGAATTC-ATGTTGCCGGGCTTCAACCG Eco RI 
Reverse AAA CTGC AG -TTAGCGGTACAGGTGTTTG A A OP A Pst i 

108a Forward AAAAAAGAATTC-GGTAACACATTCGGCAGCTTAGACGGTGG Eco RI 
Reverse AA ACTGC AG -TT AGCGGT AC AGGTGTTTG A A HP A Pst i 

109 Forward AAAGAATTC-ATGTATTATCGCCGGGTTATGGG Eco RI 
Reverse AA ACTGC AG -CTAGCCCAAAGATTTG A A GTOTTP PstI 

111 Forward CGC GGATCCCATATG -TGTTCGGAACA A ACCnr; BamHI- 

Ndel 

Reverse CCCGCTCGAG-GCGGAGCAGTTTTTCAAA Xhol 
114 Forward CGC GG ATCCC ATATG -GCTTCCATC A CTTC.nr BamHIw 

Ndel 

Reverse CCCGCTCGAG-CATCCGCGAAATCGTC Xhol 

117 Forward AAAAAAGGTACC-ATGGTCGAAGAACTGGAACTGCTG Kpnl 
Reverse AA ACTGC AG -TT AAAGCCGGGTA A CGCTC A AT AP PstI 

118 Forward AA AGTCG AC ATGTGTGAGTTC AAGG A T ATT ATA Ar, Sal j 
Reverse AAAGCATGC.CTATTTTTTGTTGTAATAATCAAATC SphI 

121 Forward CGC GGATCCC ATATG -GAAAC AC AGCTTT APAT BamHI- 

Ndel 

Reverse CCCGCTCGAG-ATAATAATATCCCGCGCCC Xhol 

122 Forward CG CGGATCCC AT ATG -GTCATGATTA A A A TC.r.ac. A BamHI- 

Ndel 

Reverse CCCGCTCGAG-AATCTTGGTAGATTGGATTT Xhol 

125 Forward AAAGAATTC-ATGTCGGGCAATGCCTCCTCTCC Eco RI 
Reverse AA ACTGC AG -TC ACGCCGTTTCAAGACG Pst ! 

125a Forward AAAAAAGAATTC- ACGGCAGGCAGCACCGCCGCACAGGTTTC EcoRI 
Reverse AAAAA ACTGCAG - Pst j 

TTATTTTGCCACGTCGGTTTCTCCGGTGAACAACGC 

126 Forward CGCGGATCCCATATG-CCGTCTGAAACCC BamHI 
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Reverse CCCGCTCGAG-ATATTCCGCCGAATGCC 

127 Forward AAAGAATTC-ATGGAAATATGGAATATGTTGGACACTTG 
Reverse AAACTGCAG-TTAAAGTGTTTCGGAGCCGGC 

127a Forward AAAAAAGAATTC-AAGGAACTGATTATGTGTCTGTCGGG 
Reverse AAACTGCAG-TTAAAGTGTTTCGGAGCCGGC 

128 Forward CGCGGATCCCATATG-ACTGACAACGCACT 

Reverse CCCGCTCGAG-GACCGCGTTGTCGAAA 
130 Forward CGC GGATCCCATATG -AAACAArTPPr T rriA 

Reverse CCCGCTCGAG-GAATTTTGCACCGGATTG 
132 Forward AAAGAATTC-ATGGAACCCTTCAAAACCTTAATTTG 
Reverse AAAAAACTGCAG-TCACCATGTCGGCATTTGAAAAAC 

134 Forward CG CGGATCCCATATG -TCCCA AfiA A ATrrrr 

Reverse CCCGCTCGAG-CAGTTTGACCGAATGTTC 

135 Forward CG CGGATCCCATATG -AAATAHA AAAfiA atph TA tt 

CCCGCTCGAG-AAATTCGGTCAGAAGCAGG 
AAAAAAGGTACC-ATGATTACCCATCCCCAATTCGATCC 
AAAAAACTGCAG-TCAGTGCTGTTTTTTCATGCCGAA 
AAAAAAGAATTC-GGCCGCAAACACGGCATCGGCTTCCT 
AAAAAACTGCAG-TTAAGCGGGATGACGCGGCAGCATACC 
AAAAAAGAATTC-AACTCAGGCGAAGGAGTGCTTGTGGC 
AAAAAATCTAGA^TCAGTTTAGGGATAGCAGGCGTAC 
AAAGAATTC-ATGAGCTTCAAAACCGATGCCGAAATCGC 
AAACTGCAG-TCAGAACAAGCCGTGAATCACGCC 
CGCG^ATCCCATATG-CGTGCCGATTTCATG 

CCCGCTCGAG-AAACTGCTGCACATGGG 
AAAAA AGAATTC - 

ATGCTCAGTTTCGGCTTTCTCGGCGTTCAGAC 

AAAAAACTGCAG-TCAAACCCCGCCGTGTGTTTCTTTAAT 

AAAAAAGAATTC-GGTCTGATCGACGGGCGTGCCGTAAC 

AAAAAATCTAGA-TCGGCATCGGCCGGCATATGTCCG 
AAAAA AGAATTC - 

CGCCAAGTCGTCATTGACCACGACAAAGTC 

AAAAAACTGCAG-TTAGGCATCGGCAAATAGGAAACTGGG 

AAAAAAGAATTC-ACTGAGCAATCGGTGGATTTGGAAAC 

AAAAAATCTAGA-TTAGGTAAAGCTGCGGCCCATTTGCGG 
AAAAA AGAATTC - 

ATGGCGTTAAAAACATCAAACTTGGAACACGC 

AAAAAATCTAGA-TCAGCCCTTCATACAGCCTTCGTTTTG 
CGCGGATCCCATATG-CTGCTTr; apa An a a a^.-t 



Reverse 

137 Forward 
Reverse 

137a Forward 
Reverse 

138 Forward 
Reverse 

141 Forward 
Reverse 

142 Forward 

Reverse 

143 Forward 

Reverse 

144 Forward 
Reverse 

146 Forward 

Reverse 

147 Forward 
Reverse 

148 Forward 

Reverse 

149 Forward 



Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

Kpnl 

PstI 

EcoRI 

PstI 

EcoRI 

Xbal 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 
EcoRI 
Xbal 
EcoRI 

PstI 
EcoRI 
Xbal 
EcoRI 

Xbal 
BamHI- 
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Ndel 

Reverse CCCGCTCGAG-AAACTTCACGTTCACGCC Xhol 

150 Forward CG CGGATCCCATATG -CAGAACACAAATCCG BamHI- 

Ndel 

Reverse CCCGCTCGAG-ATAAACATCACGCTGATAGC Xhol 

151 Forward AAAAA AGAATTC - EcoRI 

ATGAAACAAATCCGCAACATCGCCATCATCGC 

Reverse AAAAAACTGCAG-TCAATCCAGCTTTTTAAAGTGGCGGCG PstI 

152 Forward AAAAA AGAATTC - EcoRI 

ATGAAAAACAAAACCAAAGTCTGGGACCTCCC 

Reverse AAAAAACTGCAG-TCAGGACAGGAGCAGGATGGCGGC PstI 

153 Forward AAAAAAGAATTC-ATGGCGTTTGCTTACGGTATGAC EcoRI 
Reverse AAAAAACTGCAG-TCAGTCATGTTTTTCCGTTTCATT PstI 

153a Forward AAAAuAAGAATTC-CGGACTTCGGTATCGGTTCCCCAGCATTG EcoRI 

Reverse AAAAA ACTGCAG - p st I 

TTACGCCGACGAAATACTCAGACTTTTCGG 

154 Forward CGC GGATCCCATATG -ACTGACAACAGCCC BamHI- 

Ndel 

Reverse CCC GCTCGAG -TCGGCTTCCTTTCGGG Xhol 

155 Forward AAAAAAGAATTC-ATGAAAATCGGTATCCCACGCGAGTC EcoRI 
Reverse AAAAAACTGCAG-TTACCCTTTCTTAAACATATTCAGCAT PstI 

156 Forward AAAAA AGAATTC -GC ACAGCAAAACGGTTTTGA AGC EcoRI 
Reverse AAAAAACTGCAG-TCAAGCAGCCGCGACAAACAGCCC PstI 

157 Forward CGC GGATCCCATATG -AGGAACGAGGAAAAAC BamHI- 

Ndel 

Reverse CCCGCTCGAG-AAAACACAATATCCCCGC Xhol 

158 Forward AAAAAAGAATTC-GCGGAGCAGTTGGCGATGGCAAATTCTGC EcoRI 
Reverse AAAAAATCTAGA-TTATCCACAGAGATTGTTTCCCAGTTC Xba l 

160 Forward CGC GGATCCCATATG -GAC ATTCTGG A C A A A C. BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTTTGCCCGCCTTCTTT Xhol 

163 Forward AAAAAAGGTACC-ACCGTGCCGGATCAGGTGCAGATGTG Kpnl 
Reverse AAAAAATCTAGA-TTACTCTGCCAATTCCACCTGCTCGTG Xbal 

163a Forward AAAAAAGAATTC-CGGCTGGTGCAGATAATGAGCCAGAC EcoRI 
Reverse AAAAAATCTAGA-TTACTCTGCCAATTCCACCTGCTCGTG Xbal 

164 Forward CGC GGATCCCATATG -AACCGGACTTATGCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTGTTTCCGTCAAACTGC Xhol 

165 Forward CGC GGATCCGCTAGC -GCTGAAGCGACAGACG BamHI- 

Nhel 

Reverse CCCGCTCGAG-AATATCCAATACTTTCGCG Xhol 
206 Forward CGC GGATCCCATATG -AAACACCGCCAAGCGA BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTCTGTAAAAAAAGTATGTGC Xhol 
209 Forward CGC GGATCCCATATG -CTGCGGCATTTAGGA BamHI- 
Ndel 
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Reverse CCCGCTCGAG-TACCCCTGAAGGCAAC ~ 

Forward AAAAAAGAATTC-ATGTTGCGGGTTGCTGCTGC 
Reverse AAAAAACTGCAG-CTATCCTGCGGATTGGCATTGAAA 
212 Forward CGCGGATCCCA1ATG-GACAATCTCGTATGG 



211 



Reverse CCCGCTCGAG-AGGGGTTAGATCCTTCC 

215 Forward CGC GGATCCCATATG -GCATGGTTGGnTf'riT 

Reverse CCCGCTCGAG-CATATCTTTTGTATCATAAATC 

216 Forward CGCGGAT^CCATATG-GCAATGGCAGAAAACG 

Reverse CCCGCTCGAG-TACAATCCGTGCCGCC 

217 Forward CGCGGATCCCATATG-GCGGATGACGGTGTG 

Reverse CCCGCTCGAG-ACCCCGAATATCGAATCC 

218 Forward CGCGjGATCCCATATG-GTCGCGGTCGATC 

Reverse CCCGCTCGAG-TAACTCATAGAATCCTGC 

219 Forward CGCGGATCCGCTAGC-ACGGCAAGGTTAAG 

Reverse CCCGCTCGAG-TTTAAACCATCTCCTCAAAAC 
223 Forward CGCGGAT^CCATATG-GAATTCAGGCACCAAGTA 

Reverse CCCGCTCGAG-GGCTTCCCGCGTGTC 
225 Forward CGCGGATCCCATATG-GACGAGTTGACCAACC 



226 



Reverse CCCGCTCGAG-GTTCAGAAAGCGGGAC 
Forward AAAGAATTC-CTTGCGATTATCGTGCGCACGCG 
Reverse AAACTGCAG-TCAAAATCCCAAAACGGGGAT 
228 Forward CGCG^ATCCCATATG-TCGCAAGAAGCCAAACAG 



Reverse CCCGCTCGAG-TTTGGCGGCATCTTTCAT 

229 Forward CGC GGATCCCATATG -CA AG A CjCiTTTTncTr 

Reverse CCCGCTCGAG-ACACAATATAGCGGATGAAC 

230 Forward CGC GGATCCCATATG -CA TCCGanrnrm a r 

Reverse CCCGCTCGAG-AAGTTTGGCGGCTTCGG 

232 Forward AAAAAAGAATrC-ATGTACGCTAAAAAAGGCGGTTTGGG 
Reverse AAAAAACTGCAG-TCAAGGTTTTTTCCTGATTGCCGCCGC 

232a Forward AAAAAAGAATTC-GCCAAGGCTGCCGATACACAAATTGA 
Reverse AAAAAACTGCAG-TTAAACATTGTCGTTGCCGCCCAGATG 

233 Forward CGCGGATCCCATATG-GCGGACAAACCCAAG 

Reverse CCCGCTCGAG-GACGGCATTGAGCAG 

234 Forward CGCGG ATCCC ATATG-GCCGTTTr; A era a rnn 



Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Nhel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 
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Reverse 

235 Forward 

Reverse 

236 Forward 

Reverse 

238 Forward 

Reverse 

239 Forward 

Reverse 

240 Forward 

Reverse 

241 Forward 

Reverse 

242 Forward 

Reverse 

243 Forward 

Reverse 

244 Forward 

Reverse 

246 Forward 

Reverse 

247 Forward 

Reverse 

248 Forward 

Reverse 

249 Forward 

Reverse 
25 1 Forward 

Reverse 
253 Forward 
Reverse 



GCCC AAGCTT -ACGGTTGGATTGCCATG 
CGC GGATCCCATATG -GCCTGCC AAGTTC A A A 

CCCG CTCGAG -TTTGGGCTGCTCTTC 
CGC GGATCCCATATG -GCGCGTTTCGCCTT 

CCCGCTCGAG-ATGGGTCGCGCGCCGT 
CGC GGATCCGCTAGC -AACGGTTTGGATGCCrn 

CCCGCTCGAG-TTTGTCTAAGTTCCTGATATG 
CC GGAATTC T ACATATG -CTCCACCATAAAGGTATTG 

CCCG CTCGAG -TGGTGAAG AGCGGTTTA G 
CGC GGATCCCATATG -GACGTTGGACGATTTG 

CCCGCTCGAG-AAACGCCATTACCCGATG 
CC GGAATTC T ACATATG -CCAACACGTCCAACT 

CCCGCTCGAG-GAATGCGCCTGTAATTAATC 
CGC GGATCCCATATG -ATCGGCAAACTTGTTn 

GCCCAAGCTT-ACCGATACGGTCGCAG 
CGC GGATCCCATATG -ACGATTTTTTCGATGGTGC 

CCCGCTCGAG-CGACTTGGTTACCGCG 
CGC GGATCCCATATG -CCGTCTGAAGCCC 

CCC GCTCGAG -TTTTTTCGGTAGGGGATTT 
CGCGGATCCCATATG-GACATCGGCAGTGC 

CCC GCTCGAG -CCCGCGCTGCTGGAG 
CGC GGATCCCATATG -GTCGGATCGAGTT A C. 

CCCGCTCGAG-AAGTGTTCTGTTTGCGCA 
CGC GGATCCCATATG -CGCAAACAGA AC. ACT 

CCCGCTCGAG-CTCATCATTATTGCTAACA 
CGC GGATCCCATATG -AAGAATAATGATTGHTTP 

CCCGCTCGAG-TTCCCGACCTCCGAC 
CGC GGATCCCATATG -CGTGCTGCGGT A GT 

CCCGCTCGAG-TACGAAAGCCGGTCGTG 
AAAAAAGAATTC-ATGATTGACAGGAACCGTATGCTGCG 
AAAAAACTGCAG-TTATTGGTCTTTCAAACGCCCTTCCTG 



Ndel 

Hind in 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Nhel 

Xhol 

EcoRI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Hmdlll 

BamHI- 

Ndel 

Xhol 

BamHI- 

NdeT 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 
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AAAAAAGAATTC-AAAATCCTTTTGAAAACAAGCGAAAACGG 
AAAAAACTGCAG-TTATTGGTCTTTCAAACGCCCTTCCTG 
AAAAAAGAATTC-ATGTATACAGGCGAACGCTTCAATAC 
AAAAAATCTAGA-TCAGATTACGTAACCGTACACGCTGAC 
CGCGGAl^CCATATG-GCCGCGTTGCGTTAC 

CCCGCTCGAG-ATCCGCAATACCGACCAG 
CGCGGATCCGCTAGC-TTTTAACACCGCCGGAC 

CCCGCTCGAG-ACGCCTGTTTGTGCGG 
CGC GGATCCCATATG -GCGGTTTCTTTrrTO 

CCCGCTCGAG-GCGCGTGAATATCGCG 

AAAAAAGAATTC-GATTATTTCTGGTGGATTGTTGCGTTCAG 
AAAAAACTGCAG-CTACGCATAAGTTTTTACCGTTTTTGG 
AAAAAAGAATTC-GCGAAGGCGGTGGCGCAAGGCGA 
AAAAAACTGCAG-CTACGCATAAGTTTTTACCGTTTTTGG 
CGCGGATCCCATATG-GAAGAGCTGCCTCCG 

CCCGCTCGAG-GGCTTTTCCGGCGTTT 
CGCGGATCCCATATG-GGTGCGGGTATGGT 

CCCGCTCGAG-AACAGGGCGACACCCT 
AAAAAAGAATTC-CAAGATACAGCTCGGGCATTCGC 

AAAAAACTGCAG-TCAAACCAACAAGCCTTGGTCACT 
CGC GGATCCCATATG -GCACGTTTAAHrnTA 

CCCGCTCGAG-GGCGTAAGCCTGCAATT 
AAAAAAGGTACC-GCCGACGCAGTGGTCAAGGCAGAA 
AAACTGCAG-TCAGCCGGCGGTCAATACCGCCCG 
AAAAAAGAATTC-GCGGAGGTCAAGAGAAGGTGTTTG 
AAAAAACTGCAG-TTACGAATACGTCGTCAAAATGGG 
AAAGAATTC-CTCATCTTTGCCAACGCCCCCTTC 
AAACTGCAG-CTATTCCCTGTTGCGCGTGTGCCA 
AA AGAATTC -TTCTTCCGATTCGATGTTA Arm 

AAACTGCAG-TTAGTAAAAACCTTTCTGCTTGGC 
AAAGAATTC-TGCAAACCTTGCGCCACGTGCCC 
AAACTGCAG-TTACGAAGACCGCAACGAAAGGCAGAG 
AAAAAAGAATTC-GACTTTATCCAAAACACGGCTTCGCC 
AAACTGCAG-TTACGAAGACCGCAACGAAAGGCAGAG 
AAAGAATTC-GCCGTCAAGCTCGTTTTGTTGCAATG 
AAACTGCAG-TTATTCGGCGGTAAATGCCGTCTG 
CGCGGATCCCATATG-CCTGTGTGCAGCTCGAC 



253a Forward 
Reverse 

254 Forward 
Reverse 

255 Forward 

Reverse 

256 Forward 

Reverse 

257 Forward 



Reverse 
258 Forward 
Reverse 
258a Forward 
Reverse 
Forward 



259 

260 

261 
263 

264 
265 
266 
267 
269 
269a 
270 
271 



Reverse 
Forward 

Reverse 
Forward 
Reverse 
Forward 

Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 



Reverse CCCGCTCGAG-TCCCAGCCCCGTGGAG 



EcoRI 
PstI 
EcoRI 
Xbal 
BamHI- 
Ndel 
Xhol 
BamHI- 
Nhel 
Xhol 
BamHI- 
Ndel 
Xhol 
EcoRI 
PstI 
EcoRI 
PstI 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
EcoRI 
PstI 
BamHI- 
Ndel 
Xhol 
Kpnl 
PstI 
EcoRI 
PstI 
EcoRI 
PstI 
EcoRI 
PstI 
EcoRI 
PstI 
EcoRI 
PstI 
EcoRI 
PstI 
BamHI- 
Ndel - 
Xhol 
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272 Forward AAAGAATTC-ATGACCGCAAAGGAAGAACTGTTCGC Eco RI 

Reverse AAACTGCAG-TCAGAGCAGTTCCAAATCGGGGCT PstI 

273 Forward AAAGAATTC-ATGAGTCTTCAGGCGGTATTTATATACCC Eco RI 
Reverse AAACTGCAG-TTACGCGTAAGAAAAAACTGC PstI 

274 Forward CGC GGATCCCATATG -ACAGATTTGGTTAnGGAr BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTGCTTTCAGTATTATTG A A Xhol 

276 Forward AAAAA AGAATTC - Eco RI 

ATGATTTTGCCGTCGTCCATCACGATGATGCG 
Reverse AAAAA ACTGC AG -CT AC ACC ACC ATCGGCG A A TTT A Tnnr p st i 

277 Forward AAAAAAGAATTC-ATGCCGCGCTTTGAGGACAAGCTCGTAGG Eco RI 
Reverse AAAAAACTGCAG-TCATAAGCCATGCTTACCTTCCAACAA PstI 

277a Forward AAAAAAGAATTC-GGGGCGGCGGCTGGGTTGGACGTAGG Eco RI 

Reverse AAAAAACTGCAG-TCATAAGCCATGCTTACCTTCCAACAA PstI 

278 Forward AAAAAAGGTACC-GTCAAAGTTGTATTAATCGGGCCTTTGCC Kpn I 
Reverse AAAAA ACTGC AG -TCATTCAACC AT ATC A A A TCTnr.C p st i 

278a Forward AAAAAAGAATTC-AAAACTCTCCTAATTCGTCATAGTCG Eco RI 

Reverse AAAAA ACTGC AG -TCATTCAACC ATATC AAA TCTnfT p st i 

279 Forward CGC GGATCCCATATG -TTGCCTGCAATCACGATT BamHI- 

Ndel 

Reverse CCC GCTCGAG -TTT AGAAGCGGGCGGC A A Xhol 

280 Forward AAAAAAGGTACC-GCCCCCCTGCCGGTTGTAACCAG Kpn I 
Reverse AAAAAACTCCAG-TTATTGCTTCATCGCGTTGGTCAAGGC PstI 

281 Forward AAAAAAGAATTC-GCACCCGTCGGCGTATTCCTCGTCATGCG Eco RI 
Reverse AAAAAATCTAGA-GGTCAGAATGCCGCCTTCTTTGCCGAG Xbal 

281a Forward AAAAAAGAATTC-TCCTACCACATCGAAATTCCTTCCGG EcoRI' 
Reverse AAAAAAJCJAGA-GGTCAGAATGCCGCCTTCTTTGCCGAG Xbal 

282 Forward AAAAAAGAATTC-CTTTACCTTGACCTGACCAACGGGCACAG Eco RI 
Reverse AAAAAACTGCAG-TCAACCTGCCAGTTGCGGGAATATCGT PstI 

283 Forward CGC GGATCCCATATG -GCCGTCTTTACTTGGAAG BamHI- 

Ndel 

Reverse CCCGCTCGAG-ACGGCAGTATTTGTTTACG Xhol 

284 Forward CGC GGATCCCATATG -TTTGCCTGCAAAAGAATPO BamHI- 

Ndel 

Reverse CCCGCTCGAG-CCGACTTTGCAAAAACTG Xhol 

286 Forward CGC GGATCCCATATG -GCCGACCTTTCCGA AAA BamHI- 

Ndel 

Reverse CCCGCTCGAG-GAAGCGCGTTCCCAAG Xhol 

287 Forward CCG GAATTC T AGCTAGC -CTTTCAGCCTGCGGG EcoRI- 

Nhel 

Reverse CCCG CTCGAG -ATCCTGCTCTTTTTTGCC Xhol 

288 Forward CGC GGATCCCATATG -CACACCGGACAGfi BamHI- 

Ndel 

Reverse CCCGCTCGAG-CGTATCAAAGACTTGCGT Xhol 
290 Forward CGCGGATCCCATATG-GCGGTTTGGGGCGGA BamHI- 
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Reverse CCCGCTCGAG-TCGGCGCGGCGGGC 
292 Forward CGCGGATCCCATATG-TGCGGGCAAACGCCC 

Reverse CCCGCTCGAG-TTGATTTTTGCGGATGATTT 

294 Forward AAAAAAGAATTC-GTCTGGTCGATTCGGGTTGTCAGAAC 
Reverse AAAAAACTGCAG-TTACCAGCTGATATAAAACATCGCTTT 

295 Forward CGC GGATCCCATATG -AACCGGCCGGmTrr 



Reverse CCCGCTGGAG-CGATATTTGATTCCGTTGC 

Forward AAAAAAGAATTC-GCATACATTGCTTCGACAGAGAG 

Reverse AAAAAACTGCAG-TCAATCCGATTGCGACACGGT 

Forward AAAAAAGAATTC-CTGATTGCCGTGTGGTTCAGCCAAAACCC 

Reverse AAAAAACTGCAG-TCATGGCTGTGTACTTGATGGTTGCGT 

Forward CGC GGATCCGCTAGC -CTACCTGTCGrrTrro 

CCCGCTCGAG-TTGCCTGATTGCAGCGG 
AAAAAAGAATTC-ATGAGTCAAACCGATACGCAACG 
AAAAAACTGCAG-TTAAGGTGCGGGATAGAATGTGGGCGC 
AAAAAAGGTACC-GAATTTTTACCGATTTCCAGCACCGGA 
AAAAAACTGCAG-TCATTCCCAACTTATCCAGCCTGACAG 
AAAAAAGGTACC-TCCCGTTCGGGCAGTACGATTATGGG 
AAAAAACTGCAG-TTACAAACCGACATCATGCAGGGTGAA 
CGCGGATCCCATATC-TTTATGAACAAATTTTCCC 



297 



298 



299 



Reverse 
302 Forward 
Reverse 

305 Forward 
Reverse 

305a Forward 
Reverse 

306 Forward 



Reverse CCCGCTCGAG-CCGCATCGGCAGAC 
308 Forward CG CGGATCCCATATG -TTAAATCGGGTATTTTATr- 



311 



312 



313 



401 



Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 



CCCGCTCGAG-ATCCGCCATTCCCTGC 

AAAAAAGGTACC-ATGTTCAGTTTTGGCTGGGTGTTT 

AAACTGCAG-ATGTTCATATTCCCTGCCTTCGGC 

AAAAAAGGTACC-ATGAGTATCCCATCCGGCGAAATT 

AAACTGCAG-TCAGTTTTTCATCGATTGAACCGG 

AAAAAAGAATTC-ATGGACGACCCGCGCACCTACGGATC 

AAAAAACTGCAG-TCAGCGGCTGCCGCCGATTTTGCT 

CGCGGATGCCATATG-AAGGCGGCAACACAGC 



Reverse CCCGCTCGAG-CCTTACGTTTTTCAAAGCC 
402 Forward AAAAAAGAATTC-GTGCCTCAGGCATTTTCATTTACCCTTGC 

Reverse AAAAAATCTAGA-TTAAATCCCTCTGCCGTATTTGTATTC 
402a Forward AAAAAAGAATTC-AGGCTGATTGAAAACAAACACGG 

Reverse AAAAAATCTAGA-TTAAATCCCTCTGCCGTATTTGTATTC 
406 Forward CGCGGATCCCATATG-TGCGGGACACTGACAG 

Reverse CCCGCTCGAG- AGGTTGTCCTTOTr'T A Tn 



Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Nhel 

Xhol 

EcoRI 

PstI 

Kpnl 

PstI 

Kpnl 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

Kpnl 

PstI 

Kpnl 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

Xbal 

EcoRI 

Xbal 

BamHI- 

Ndel 

Xhol 
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<{\1 
DU1 


r orwara tuuuuA l llla l a 1 G-GC AGGCGG AGATGGC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GGTGTGATGTTCACCC 


Xhol 




Forward LutuuAl CCCATATG-GTAGACGCGCTTAAGCA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-AGCTGCATGGCGGCG 


Xhol 




r orwara LuCOCjA 1 CCCATATG-TGTTCGGGGAAAGGCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CCGCGCATTCCTCGCA 


Xhol 


5U4 


Forward CGCGGATCCCATATG-AGCGATATTGAAGTGACG 


BamHI- 






Ndel 




Reverse GCCCAAGCTT-TGATTCAAGTCCTTGCCG 


Hindlll 


5Ur> 


Forward CGCGGATCCCATATG-TTTCGTTTACAATTCAGG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CGGCGTTTTATAGCGG 


Xhol 


510 


Forward CGCGGATCCCATATG-CCTTCGCGGACAC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GCGCACTGGCAGCG 


Xhol 


512 


Forward CGCGGATCCCATATG-GGACATGAAGTAACGOT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-AGGAATAGCCTTTGACG 


Xhol 


515 


Forward CGCGGATCCCATATG-GAGGAAATAGCCTTCGA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-AAATGCCGCAAAGCATC 


Xhol 


516 


Forward CGCGGATCCCATATG-TGTACGTTGATGTTGTGG 


BamHI- 






Ndel 






Xhol 


^ 1 H 

M / 


■r orward CGCGGATCCC ATATG-GGTAAAGGTGTGG A A ATA 


BaniHI- 






Ndel 




Reverse OCfYrCTPG A G-GTGPOPPr* A nc^nnT 


Xhol 


Mo 


Forward AAAGAATTC-GCTTTTTTACTGCTCCGACCGG A A Gfi 


EcoRI 




Reverse AAACTGCAG-TCAAATTTCAGACTCTGCCAC 


PstI 


CIA 

Diy 


Forward CGCGGATCCC ATATG-TTC AAATCCTTTGTCGTC A 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTGGCGGTTTTGCTGC 


Xhol 


3ZU 


r orwara LvjLWjA 1 CCC ATATG-CCTGCGCTTCTTTC A 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-ATATTTACATTTCAGTCGGC 


Xhol 


<o 1 


Forward CGCGGATCCCATATG-GCCAAAATCTATACCTGr: 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CATACGCCCCAGTTCC 


Xhol 


522 


Forward CGCGGATCCCATATG-ACTGAGCCGAAACAC 


BamHI- 






Ndel 




Reverse GCCC AAGCTT-TTCTGATTTC AAATCGGC A 


Hindlll 


523 


Forward CGCGGATCCCATATG-GCTCTGCTTTCnGCG 


BamHI- 






Ndel 
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Reverse CCCGCTCGAG-AGGGTGTGTGATAATAAGAAG 

525 Forward CGC GGATCCCATATG -GCCGA A ATGOTTP a aatp 

Reverse CCCGCTCGAG-GCCCGTGCATATCATAAA 
527 Forward AAAGAATTC-TTCCCTCAATGTTGCCGTTTTCG 
Reverse AAACTGCAG-TTATGCTAAACTCGAAACAAATTC 

529 Forward CGCGGATCCGCTAGC-TGCTCCGGCAGCAAAAC 

Reverse GCC C AAGCTT - ACGC AGTTCGG A A Tflfi A n 

530 Forward CGCGGATCCCATATG-AGTGCGAGCGCGG 



Reverse 

531 Forward 
Reverse 

532 Forward 
Reverse 

532a Forward 
Reverse 

535 Forward 
Reverse 

537 Forward 



CCCGCTCGAG-ACGACCGACTGATTCCG 

AAAAAAGAATTC-TATGCCGCCGCCTACCAAATCTACGG 

AAAAAACTGCAG-TTAAAACAGCGCCGTGCCGACGACAAG 

AAAAAAGAATTC-ATGAGCGGTCAGTTGGGCAAAGGTGC 

AAAAAACTGCAG-TCAGTGTTCCAAGTGGTCGGTATCAAA 

AAAAAAGAATTC-TTGGGTGTCGCGTTTGAGCCGGAAGT 

AAAAAACTGCAG-TCAGTGTTCCAAGTGGTCGGTATCAAA 

AAAGAATTC-ATGCCCTTTCCCGTTTTCAGAC 

AAACTGCAG-TCAGACGACCCCGCCTTCCCC 

CGCGGATCCCATATG-CATACCCAAAACCAATCC 



538 



Reverse CCCGCTCGAG-ATCCTGCAAATAAAGGGTT 
Forward CGCGGATCCCATATG-GTCGAGCTGGTCAAAGC 



Reverse CCCGCTCGAG-TGGCATTTCGGTTTCGTC 
539 Forward CGCGGATCCGCTAGC-GAGGATTTGCAGGAAA 



Reverse 

542 Forward 
Reverse 

543 Forward 
Reverse 

543a Forward 
Reverse 

544 Forward 
Reverse 

544a Forward 

Reverse 

547 Forward 
Reverse 

548 Forward 



CCCGCTCGAG-TACCAATGTCGGCAAATC 

AAAGAATTC-ATGCCGTCTGAAACCGTGTC 

AAACTGCAG-TTACCGCGAACCGGTCAGGAT 

AAAAAAGAATTC-GCCTTCGATGGCGACGTTGTAGGTAC 
AAAAA ATCTAGA - 

TTAATGAAGAAGAACATATTGGAATTTTGG 

AAAAAAGAATTC-GGCAAAACTCGTCATGAATTTGC 
AAAAA ATCTAGA - 

TTAATGAAGAAGAACATATTGGAATTTTGG 

AAAGAATTC-GCGCCCGCCTTCTCCCTGCCCGACCTGCACGG 

AAACTGCAG-CTATTGCGCCACGCGCGTATCGAT 

AAAAA AGAATTC - 

GCAAATGACTATAAAAACAAAAACTTCCAAGTACTTGC 
AAACTGCAG-CTATTGCGCCACGCGCGTATCGAT 

AAAGAATTC-ATGTTCGTAGATAACGGATTTAATAAAAC 

AAACTGCAG-TTAACAACAAAAAACAAACCGCTT 

AAAGAATTC-GCCTGCAAACCTCAAGACAACAGTGCGGC 



Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Nhel 

HincUII 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Nhel 

Xhol 

EcoRI 

PstI 

EcoRI 

Xbal 

EcoRI 
Xbal 

EcoRI 

PstI 

EcoRI 

PstI 
EcoRI 
PstI 
EcoRI 
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Reverse AA ACTGCAG -TCAGAGCAGGGTCCTTACATGGGH piH 

550 Forward AAAAAA GTCGAC - Sal i 

ATGATAACGGACAGGTTTCATCTCTTTCATTTTCC 
Reverse AA ACTGC AG -TTACGC AAACGCTGC AA A A TC.C.C.C p st I 

550a Forward AAAAAAGAATTC-GTAAATCACGCCTTTGGAGTCGCAAACGG Eco RI 
Reverse AAACTGCAG-TTACGCAAACGCTGCAAAATCCCC Pst I 

552 Forward AAAAA AG AATTC -TTGGCGCGTTGGCTGG AT A C. Eco RI 

Reverse AAACTGCAG-TTATTTCTGATGCCTTTTCCCAAC Pst I 

554 Forward CGC GGATCCCATATG -TCGCCCGCGCCC AAf BamHI- 

Ndel 

Reverse CCCGCTCGAG-CTGCCCTGTCAGACAC Xhol 

556 Forward AAAGAATTC-GCGGGCGGTTTTGTTTGGACATCCCG Eco RI 
Reverse AAACTGCAG-TTAACGGTGCGGACGTTTCTGACC Pst I 

557 Forward CGCGGArCCCATATG-TGCGGTTTCCACCTGAA BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTCCGCCTTCAGAAAGG Xhol 

558 Forward AAAGAATTC-GAGCTTTATATGTTTCAACAGGGGACGGC Eco RI 
Reverse AAACTGCAG-CTAAACAATGCCGTCTGAAAGTGGAGA Pst I 

558a Forward AAAAAAGAATTC-ATTAGATTCTATCGCCATAAACAGACGGG Eco RI 
Reverse AAAAAACTGCAG-CTAAACAATGCCGTCTGAAAGTGGAGA Pst I 

560 Forward AAAAA AGAATTC - Eco RI 

TCGCCTTTCCGGGACGGGGCGCACAAGATGGC 
Reverse AAAAAACTGCAG-TCATGCGGTTTCAGACGGCATTTTGGC Pst I 

561 Forward CC GGAATTC T ACATATG -ATACTGCCAGCCCGT EcoRI- 

Ndel 

Reverse CCCGCTCGAG-TTTCAAGCTTTCTTCAGATG Xhol 

562 Forward CGC GGATCCCATATG -GCAAGCCCGTCGAG BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGACCAACTCCAACTCGT Xhol 

565 Forward CGC GGATCCC ATATG -AAGTCGAGCGCG A AATAf BamHI- 

Ndel 

Reverse CCCGCTCGAG-GGCATTGATCGGCGGC Xhol 

566 Forward CGC GGATCCCATATG -GTCGGTGGCGAAGAGO BamHI- 

Ndel 

Reverse CCCGCTCGAG-CGCATGGGCGAAGTCA Xhol 

567 Forward CC GGAATTC T ACATATG -AGTGCGAACATCCTTG EcoRI- 

Ndel 

Reverse CCCGCTCGAG-TTTCCCCGACACCCTCG Xhol 

568 Forward CGC GGATCCCATATG -CTCAGGGTC AGA C.C. BamHI- 

Ndel 

Reverse CCC GCTCGAG -CGGCGCGGCGTTCAG Xhol 

569 Forward AAAAAAGAATTC-CTGATTGCCTTGTGGGAATATGCCCG Eco RI 
Reverse AAAAAACTGCAG-TTATGCATAGACGCTGATAACGGCAAT Pst I 

570 Forward CGCGGATCCCAJATG-GACACCTTCCAAAAAATCG BamHI- 

Ndel 

Reverse CCCGCTCGAG-GCGGGCGTTCATTTCTTT Xhol 
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571 Forward AAAAA AGAATTC - ~ ' — 

ATGGGTATTGCCGGCGCCGTAAATGTTTTGAACCC 

AAAAAACTGCAG-TTATGGCCGACGCGCGGCTACCTGACG 
CGC GGATCCCATATG -GCGC A AAA Aanr a a a Arr 



Reverse 
572 Forward 



Reverse 

573 Forward 

Reverse 

574 Forward 

Reverse 

575 Forward 



CCCGCTCGAG-GCGCAGTGTGCCGATA 
CGC GGATCCCATATG -CCCTrTTTTnTorrr; 

CCCGCTCGAG-GACGGTGTCATTTCGCC 
CGCGGATCCCATATG-TGGTTTGCCGCCCGC 

CCCGCTCGAG-AACTTCGATTTTATTCGGG 
CGCGGATCCCATATG-GTTTCGGGCGAGG 



Reverse CCCGCTCGAG-CATTCCGAATCTGAACAG 
576 Forward CG CGGATCCC ATATG -GCCGCrcrrar a ty-t 



Reverse 

577 Forward 

Reverse 

578 Forward 

Reverse 

579 Forward 

Reverse 

580 Forward 

Reverse 

581 Forward 



Reverse 

582 Forward 
Reverse 

583 Forward 
Reverse 

584 Forward 

Reverse 

585 Forward 
Reverse 

586 Forward 



CCCGCTCGAG-ATTTACTTTTTTGATGTCGAC 
CG CGGATCCC ATATG -GAA AGG A A ranTn-r a ttt 

CCCGCTCGAG-AGGCTGTTTGGTAGATTCG 
CGCGGATCCCATATG-AGAAGGTTCGTACAG 

CCCGCTCGAG-GCCAACGCCTCCACG 
CGCGGATCCCATATG-AGATTGGGCGTTTCCAC 

CCCGCTCGAG-AGAATTGATGATGTGTATGT 
CGCGGAjraCATATG-AGGCAGACTTCGCCGA 

CCCGCTCGAG-CACTTCCCCCGAAGTG 
CGCGGATCCCATATG-CACTTCGCCCAGC 

CCCGCTCGAG.CGCCGTTTGGCTTTGG 

AAAAAAG^ATTC-TTTGGAGAGACCGCGCTGCAATGCGC 

AAAAAATCTAGA-TCAGATGCCGTCCCAGTCGTTGAA 

AAAAAAGAATTC-ACTGCCGGCAATCGACTGCATAATCG 

AAAAAACTGCAG-TTAACGGAGGTCAATATGATGAAATTG 
AAAAA AGAATTC - 

GCGGCTGAAGCATTGAATTACAATATTGTC 

AAAAAACTGCAG-TCAGAACTGAACCGTCCCATTGACGCT 

AAAAAAGGTACC-TCTTTCTGGCTGGTGCAGAACACCCTTGC 

AAAAAACTGCAG-TCAGTTCGCACTTTTTTCTGTTTTGGA 

CGCGGA^rcCCATATG-GCAGCCCATCTCG 



Reverse CCCGCTCGAG-TTTCAGCGAATCAAGTTTC 
587 Forward CGCGGATCCCATATG-GACCTGrrrTTr; a rr. a 



EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

Xbal 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 
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Reverse 

588 Forward 
Reverse 

589 Forward 

Reverse 

590 Forward 
Reverse 

591 Forward 
Reverse 

592a Forward 
Reverse 
593 Forward 

Reverse 
594a Forward 
Reverse 

595 Forward 
Reverse 

596 Forward 



597 



601 



602 



Reverse 
Forward 

Reverse 
Forward 

Reverse 
Forward 



Reverse 

603 Forward 
Reverse 

604 Forward 

Reverse 
606a Forward 
Reverse 

607 Forward 
Reverse 

608 Forward 
Reverse 

609 Forward 



CCCGCTCGAG-AAATGTATGCTGTACGCC 
AAAAAAGAATTC-GCCGTCCTGACTTCCTATCAAGAACCAGG 
AAAAAACTGCAG-TTATTTGTTTTTGGGCAGTTTCACTTC 
AAAAAA GAATTC - 

ATGCAACAAAAAATCCGTTTCCAAATCGAAGG 

AAAAAACTGCAG-CTAATCGATTTTTACCCGTTTCAGGCG 

AAAAAAGAATTC-ATGAAAAAACCTTTGATTTCAGTTGCGGC 

AAAAAACTGCAG-TTACTGCTGCGGCTCTGAAACCAT 

AAAAAAGAATTC-CACTACATCGTTGCCAGATTGTGCGG 

AAAAAACTGCAG-CTAACeGAGCAGCCGGGTAACGTCGTT 

AAAAAAGAATTC-CGCGATTACACCGCCAAGCTGAAAATGGG 

AAAAAACTGCAG-TTACCAAACGTCGGATTTGATACG 

CGCGGATCCGCTAGC-CTTGAACTGAACGGACTC 

CCCG CTCGAG -GCGGAAGCGGACGATT 

AAAAAAGAATTC-GGTAAGTTCGCCGTTCAGGCCTTTCA 

AAAAAACTGCAG-TTACGCCGCCGTTTCCTGACACTCGCG 

AAAAAAGAATTC-TGCCAGCCGCCGGAGGCGGAGAAAGC 

AAAAAACTGCAG-TTATTTCAAGCCGAGTATGCCGCG 

CGCGGATCCCATATG-TCCCAACAATACGTC 

CCC GCTCGAG -ACGCGTTACCGGTTTGT 
CG CGGATCCCATATG -CTGCTTCATGTCAGC 

GCCCAAGCTT-ACGTATCCAGCTCGAAG 
CG CGGATCCC ATATG - ATATGTTCCC AACCGGC A A T 

CCCGCTCGAG-AAAACAATCCTCAGGCAC 
CGC GGATCCGCTAGC -TTGCTCCATCAATGC 

CCCG CTCGAG -ATGCAGCTGCTAAAAGCG 
AAAAAAGAATTC-CTGTCCTCGCGTAGGCGGGGACGGGG 
AAAAAACTGCAG-CTACAAGATGCCGGCAAGTTCGGC 
CGC GGATCCGCTAGC -CCCGAAGCGCACTT 

CCC GCTCGAG -GACGGCATCTGCACGG 

AAAAAAGAATTC-CGCGAATACCGCGCCGATGCGGGCGC 

AAAAAACTGCAG-TTAAAGCGATTTGAGGCGGGCGATACG 

AAAAAAGAATTC-ATGCTGCTCGACCTCAACCGCTTTTC 

AAAAAACTGCAG-TCAGACGGCCTTATGCGATCTGAC 

AAAAAAGAATTC-ATGTCCGCCCTCCTCCCCATCATCAACCG 

AAAAAACTGCAG-TTAGTCTATCCAAATGTCGCGTTC 

CGCGjGATCCCATATG-GTTGTGGATAGACTCG 



Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

Eco RI 

PstI 

EcoRI 

PstI 

Eco RI 

PstI 

BamHI- 

Nhel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Hindlll 

BamHl- 

Ndel 

Xhol 

BamHI- 

Nhel 

Xhol 

EcoRI 

PstI 

BamHI- 

Nhel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 
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Reverse CCC GCTCGAG -CTGGATTATGATGTrrrtTr 

610 Forward CGC GG ATCCC ATATG - ATTGG A GGOPTT a Tern a 

Reverse CCCGCTCGAG-ACGCTTCAACATCTTTGCC 

611 Forward CGCGGATCCCATATG-CCGTCTCAAAACGGG 

Reverse CCCGCTCGAG-AACGACTTTGAACGCGCAA 

613 Forward CGC GGATCCC AT ATG -TCGCGTTCfi A dCCCll 

Reverse CCCGCTCGAG-AGCCTGTAAAATAAGCGGC 

614 Forward CGC GGATCCCATATG -TCCGTGGTGAGPnor 

Reverse CCCGCTCGAG-CCATACTGCGGCGTTC 
616 Forward AAAAAAGAATTC-ATGTCAAACACAATCAAAATGGTTGTCGG 

Reverse AAAAAATCTAGA-TTAGTCCGGGCGGCAGGCAGCTCG 
619a Forward AAAAAAGAATTC-GGGCTTCTCGCCGCCTCGCTTGC 

Reverse AAAAAACTGCAG-TCATTTTTTGTGTTTTAAAACGAGATA 
622 Forward CGCGGATCCCATATG-GCCGCCCTGCCTAAAG 

Reverse CCCGCTCGAG-TTTGTCCAAATGATAAATCTG 

624 Forward CG CGGATCCCATATG -TCCCCGCGTTTTTArrr; 

Reverse CCCGCTCGAG-AGATTCGGGCCTGCGC 

625 Forward CGC GGATCCCATATG -TTTGC A A CC. A an A a a ath 



Reverse 
627a Forward 
Reverse 

628 Forward 
Reverse 
629a Forward 
Reverse 
630a Forward 
Reverse 
635 Forward 

Reverse 

638 Forward 
Reverse 

639 Forward 



CCCGCTCGAG-CGGCAAAATTACCGCCTT 

AAAAAAGAATTC-AAAGCAGGCGAGGCAGGCGCGCTGGG 
AAAAA ACTGCAG - 

TTACGAATGAAACAGGGTACCCGTCATCAAGGC 

AAAAAAGGTACC-GCCTTACAAACATGGATTTTGCGTTC 

AAAAAACTGCAG-CTACGCACCTGAAGCGCTGGCAAA 

AAAAAAGAATTC-GCCACCTTTATCGCGTATGAAAACGA 

AAAAAACTGCAG-TTACAACACCGCCGTCCGGTTCAAACC 

AAAAAAGAArrC-GCGGCTTTGGGTATTTCTTTCGG 

AAAAAACTGCAG-TTAGGAGACTTCGCCAATGGAGCCGGG 
AAAAA AGAATTC - 

ATGACCCAGCGACGGGTCGGCAAGCAAAACCG 

AAAAAACTGCAG-TTAATCCACTATAATCCTGTTGCT 

AAAAAAGAATTC-ATGATTGGCGAAAAGTTTATCGTAGTTGG 

AAAAAACTGCAG-TCACGAACCGATTATGCTGATCGG 
CG CGGATCCCATATG - ATGCTTT A TTTTOTTfr; 



Reverse CCCGCJCGAG-ATCGCGGCTGCCGAC 
642 Forward CG CGGATCCC ATATG -CGGT ATCrorrnr 1 a a t 

Reverse CCCGCTCGAG-AGGATTGCGGGGCATTA 



Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

Xbal 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

Kpnl 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 
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643 Forward CGC GGATCCCATATG -GCTTCGCCGTCGGCAG BamHI- 

Ndel 

Reverse CCCGCTCGAG-AACCGAAAAACAGACCGC Xhol 

644 Forward AAAAA AGAATTC - Eco RI 

ATGCCGTCTGAAAGGTCGGCGGATTGTTGCCC 

Reverse AAAAAATCTAGA-CTACCCGCAATATCGGCAGTCCAATAT Pst I 

645 Forward AAAAAAGAATTC-GTGGAACAGAGCAACACGTTAAATCG Eco RI 
Reverse AAAAAACTGCAG-CTACGAGGAAACCGAAGACCAGGCCGC Pst I 

647 Forward AAAAAAGAATTC- ATGCAAAGGCTCGCCGCAGACGG Eco RI 
Reverse AAAAAACTGCAG-TTAGATTATCAGGGATATCCGGTAGAA Pst I 

648 Forward AAAAA AGAATTC - Eco RI 

ATGAACAGGCGCGACGCGCGGATCGAACG 

Reverse AAAAAACTGCAG-TCAAGCTGTGTGCTGATTGAATGCGAC Pst I 

649 Forward AAAAAAGAATTC-GGTACGTCAGAACCCGCCCACCG Eco RI 
Reverse AAAAAACTGCAG-TTAACGGCGGAAACTGCCGCCGTC Pst I 

650 Forward AAAAAAGAATTC-ATGTCCAAACTCAAAACCATCGC Eco RI 
Reverse AAAAAACTGCAG-TCAGACGGCATGGCGGTCTGTTTT Pst I 

652 Forward AAAAA AGGTACC - Kpn I 

GCTGCCGAAGACTCAGGCCTGCCGCTTTACCG 

Reverse AAAAAACTOCAG-TTATTTGCCCAGTTGGTAGAATGCGGC Pst I 

653 Forward AAAAAAGAAJTC-GCGGCTTTGCCGGTAATTTTCATCGG Eco RI 
Reverse AAAAAACTGCAG-CTATGCCGGTCTGGTTGCCGGCGGCGA Pst I 

656a Forward AAAAAAGAATTC-CGGCCGACGTCGTTGCGTCCTAAGTC Eco RI 

Reverse AAAAAACTGCAG-CTACGATTTCGGCGATTTCCACATCGT Pst I 

657 Forward AAAAAAGAATTC-GCAGAATTTGCCGACCGCCATTTGTGCGC Eco RI 
Reverse AAAAAACTGCAG-TTATAGGGACTGATGCAGTTTTTTTGC PstL- 

658 Forward CGC GGATCCCATATG -GTGTCCGGAATTGTG BaniHI- 

Ndel 

Reverse CCCGCTCGAG-GGCAGAATGTTTACCGTT Xhol 

661 Forward AAAAA AGAATTC - Eco RI 
ATGCACATCGGCGGCTATTTTATCGACAACCC 

Reverse AAAAAACT^CAG-TCACGACGTGTCTGTTCGCCGTCGGGC Pst I 

663 Forward CGC GGATCCCATATG -TGTATCGAG ATG A A A TT BamHI- 

Ndel 

Reverse CCCGCTCGAG-GTAAAAATCGGGGCTGC Xhol 

664 Forward CGC GGATCCCATATG -GCGGCTGGCGCGGT BamHI- 

Ndel 

Reverse CCCGCTCGAG-AAATCGAGTTTTACACCAC Xhol 

665 Forward AAAAAAGAATTC-ATGAAATGGGACGAAACGCGCTTCGG Eco RI 
Reverse AAAAA ACTGCAG -TC AATCCAAAATTTTGCCG A CO A TTTP p st i 

666 Forward AAAAAAGAATTC- AACTCAGGCGAAGGAGTGCTTGTGGC Eco RI 
Reverse AAAAAATCTAGA-TCAGTTTAGGGATAGCAGGCGTAC Xbal 

667 Forward AAAAAA GAATTC - Eco RI 

CCGCATCCGTTTGATTTCCATTTCGTATTCGTCCG 

Reverse AAAAAACTGCAG-TTAATGACACAATAGGCGCAAGTC Pst I 
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669 Forward AAAAAAGAATTC-ATGCGCCGCATCATTAAAAAACACCAGCC 
Reverse AAAAAACTGCAG-TTACAGTATCCGTTTGATGTCGGC 

670a Forward AAAAAAGAATTC-AAAAACGCTTCGGGCGTTTCGTCTTC 
Reverse AAAAA ACTGCAG - 

TTAGGAGCTTTTGGAACGCGTCGGACTGGC 
671 Forward CGC GGATCCC ATATG - ACC A GC. A GGGT A a r 



672 



673 



674 



675 



Reverse CCCGCTCGAG-AGCAACTATAAAAACGCAAG 
Forward CGC GGATCCC ATATG - AGGA AAA TCcar Arr 

Reverse CCCGCTCGAG-ACGGGATAGGCGGTTG 
Forward AAAAAAGAATTC-ATGGATATTGAAACCTTCCTTGCAGG 
Reverse AAAAAACTGCAG-CTACAAACCCAGCTCGCGCAGGAA 
Forward AAAAAAGAAJTC-ATGAAAACAGCCCGCCGCCGTTCCCG 
Reverse AAAAAACTGCAG-TCAACGGCGTTTGGGCTCGTCGGG 
Forward CGC GGATCCCATATG -AACACCATCGCrrr 



Reverse CCCGCTCGAG-TTCTTCGTCTTCAAACTGT 
677a Forward AAAAAAGAATTC-AGACGGCATTCCCGATCAGTCGATTTTGA 

Reverse AAAAAACTGCAG-TTACGTATGCGCGAAATCGACCGCCGC 
680 Forward CG CGGATCCGCTAGC - ACGA A GGGH A dTTcnn 



Reverse 
681 Forward 
Reverse 

683 Forward 

Reverse 

684 Forward 

Reverse 

685 Forward 

Reverse 

686 Forward 

Reverse 

687 Forward 

Reverse 

690 Forward 

Reverse 

691 Forward 



CCCGCTCGAG-CATCAAAAACCTGCCGC 
AAAAAAGAATTC-ATGACGACGCCGATGGCAATCAGTGC 

AAAAAACTGCAG-TTACCGTCTTCCGCAAAAAACAGC 
CG CGGATCCCATATG -TGCAGC AC A CC.CiCl A r A a 

CCCGCTCGAG-GAGTTTTTTTCCGCATACG 
CGCGGATCOCATATG-TGCGGTACTGTGCAAAG 

CCCGCTCGAG-CTCGACCATCTGTTGCG 
CG CGGATCCC AT ATG -TGTTTGCTT A A T A ata a Ar at T 

CCCGCTCGAG-CTTTTTCCCCGCCGCA 
CGCGGATCCCATATG-TGCGGCGGTTCGGAAG 

CCCGCTCGAG-CATTCCGATTCTGATGAAG 
CGCGGATCCCATATG-TGCGACAGCAAAGTCCA 

CCCGCTCGAG-CTGCGCGGCTTTTTGTT 
CG CGGATCCCATATG -TGTTCTCCGAGCA A AGAr 

CCCGCTCGAG-TATTCGCCCCGTGTTTGG 
CG CGGATCCCATATG -GCCACGGCTT A T A Trrn 



EcoRI 
PstI 
EcoRI 
PstI 



Reverse CCCGCTCGAG-TTTGAGGCAGGAAGAAAG 



BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
EcoRI 
PstI 
EcoRI 
PstI 
BamHI- 
Ndel 
Xhol 
EcoRI 
PstI 
BamHI- 
Nhel 
Xhol 
EcoRI 
PstI 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
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694 Forward CGC GGATCCCATATG -TTGGTTTCCGCATCCGG 



Reverse 

695 Forward 

Reverse 

696 Forward 

Reverse 
700a Forward 
Reverse 

701 Forward 

Reverse 

702 Forward 
Reverse 

703 Forward 



CCC GCTCGAG -TCTGCGTCGGTGCGGT 
CGCGGATCCCATATG-TTGCCTCAAACTCGTCCG 

CCCGCTCGAG-TCGTTTGCGCACGGCT 
CGC GGATCC C AT ATG -TTGGGTTGCCGGC AGO 

CCCGCTCGAG-TTGATTGCCGCAATGATG 
AAAAAAGAAJTC-GCATCGACAGACGGTGTGTCGTGGAC 
AAAAAACTGCAG-TTACGCTACCGGCACGACTTCCAAACC 
CGCGGATrcCCATATG-AAGACTTGTTTGGATACTTC 

CCCGCTCGAG-TGCCGACAACAGCCTC 

AAAAAAGAATTC-ATGCCGTGTTCCAAAGCCAGTTGGATTTC 
AAAAAACTGCAG-TTAACCCCATTCCACCCGGAGAACCGA 
CGC GGATCCGCTAGC -C AAACGCTGGC A AC.C.n 



Reverse 
704a Forward 
Reverse 



CCC GCTCGAG -TTTTGCAGGTTTGATGTTTG 

AAAAAAGAATTC-GCTTCTACCGGTACGCTGGCGCG 
AAAAA ACTGCAG - 

TTAGTTTTGCCGGATAATATGGCGGGTGCG 
707 Forward CGC GGATCCGCTAGC -GAAATTATTAACGATGCAOA 



Reverse 
708 Forward 

Reverse 
710 Forward 

Reverse 
714 Forward 

Reverse 
716 Forward 

Reverse 
718 Forward 

Reverse 
720 Forward 

Reverse 
723 Forward 



CCC GCTCGAG -GAAACTGTAATTCAAGTTGA 
CGCGGATCCGCJAGC-CCTTTTAAGCCATCCAAAA 

CCCGCTCGAG-TTGACCGGTGAGGACG 
CGC GGATCCCATATG -GAAACCCACGAA A A A A TC. 

CCC GCTCGAG -AACGGTTTCGGTCAG 
CGCGGATCCCATATG-AGCTATCAAGACATCTT 

CCCGCTCGAG-GCGGTAGGTAAATCGGAT 
CGCGGATCCCAJATG-GCCAACAAACCGGCAAG 

CCCGCTCGAG-TTTAGAACCGCATTTGCC 
CGC GGATCCC ATATG -G AGCCG AT AATGGH A A A 

CCCGCTCGAG-GGCGCGGGCATGGTCTTGTCC 
CGCGGATCCCAJATG-AGCGGATGGCATACC 

CCCGCTCGAG-TTTTGCATAGCTGTTGACCA 
CGCGGATCCCATATG-CGACCCAAGCCCC 



Reverse CCCGCTCGAG-AATGCGAATCCGCCGCC 



BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Nhel 

Xhol 

EcoRI 

PstI 

BamHI- 

Nhel 

Xhol 

BamHI- 

Nhel" 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 
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725 Forward CGC GGATCCCATATG -GTGCGCAGGOTTA A a 



Reverse 
726 Forward 



Reverse 

728 Forward 

Reverse 

729 Forward 

Reverse 

731 Forward 

Reverse 

732 Forward 

Reverse 

733 Forward 

Reverse 

734 Forward 

Reverse 

735 Forward 

Reverse 
737 Forward 

Reverse 

739 Forward 

Reverse 

740 Forward 

Reverse 

741 Forward 

Reverse 
743 Forward 

Reverse 

745 Forward 

Reverse 

746 Forward 



CCCGCTCGAG-TTGCTTATCCTTAAGGGTTA 
CGC GGATCCCATATG -ACCATrTATTTr A a AAAr 

CCCGCTCGAG-GCCGATGTTTAGCGTCC 
CGCGGA^CCATATG-TTTTGGCTGGGAACGGG 

CCCGCTCGAG-GTGAGAAAGGTCGCGC 
CGCGGATCCCATATG-TGCACCATGATTCCCCA 

GCCCAAGCTT-TTTGTCGGTTTGGGTATC 
CGCGGAJTCGCTAGC-GCCGTGCCGGAGG 

CCCGCTCGAG-ACGGGCGCGGCAG 
CCGGAAJTCTACATATG-TCGAAACCTGTTTTTAAGAA 

CCCGCTCGAG-CTTCTTATCTTTTTTATCTTTC 
CGCGGATCCCATATG-GCCTGCGGCGGCAA 

CCCGCTCGAG-TCGCTTGCCTCCTTTAC 
CGCGGA^CCATATG-GCCGATACTTACGGCTAT 

CCCGCTCGAG-TTTGAGATTTTGAATCAAAGAG 
CGCGGATCCCATATG-AAGCAGCAGGCGGTCA 

CCCGCTCGAG-ATTTCCGTAGCCGAGGG 
CGCGGATCCCATATG-CACCACGACGGACACG 

CCCGCTCGAG-GTCGTCGCGGCGGGA 
CGCGGATCCCATATG-GCAAAAAAACCGAACA 

CCCGCTCGAG-GAAGAGTTTGTCGAGAATT 
CGCGGATCCCATATG-GCCAATCCGCCCGAAG 

CCCGCTCGAG-AAACGCGCCAAAATAGTG 
CGCGGAT^CCATATG-TGCAGCAGCGGAGGG 

CCCGCTCGAG-TTGCTTGGCGGCAAGGC 
CGCGGATCCCATATG-GACGGTGTTGTGCCTGTT 

CCCGCTCGAG-CTTACGGATCAAATTGACG 
CGCGGAjTCCCATATG-TTTTGGCAACTGACCG 

CCCGCTCGAG-CAAATCAGATGCCTTTAGG 
CGCGGATCCCATATG-TCCGAAAACAAACAAAAC 



BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Hindlll 

BamHI- 

Nhel 

Xhol 

EcoRI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 
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Ndel 




Reverse CCCGCTCGAG-TTCATTCGTTACCTGACC 


Xhol 


747 


Forward CCGGAATTCTAGCTAGC-CTGACCCCTTGGG 


EcoRI- 






Nhel 




Reverse GCCC AAGCTT-TTTTGATTTT AATTGACTA T A G A A C 


HindUI 


749 


Forward CGCGGATCCCATATG-TGCCAGCCGCCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTCAAGCCGAGTATGC 


Xhol 


750 


Forward CGCGGATCCCATATG-TGTTCGCCCGAACCTG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CTTTTTCCCCGCCGCAA 


Xhol 


758 


Forward CGCGGATCCCATATG-AACAATCTGACCGTGTT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TGGCTCAATCCTTTCTGC 


Xhol 


759 


Forward CGCGGATCCGCTAGC-CGCTTCACACACACCAC 


BamHI- 






Nhel 




Reverse CCCGCTCGAG-CCAGTTGTAGCCTATTTTG 


Xhol 


763 


Forward CGCGGATCCCATATG-CTGCCTGAAGCATGGCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTCCGCAAATACCGTTTCC 


Xhol 


764 


Forward CGCGGATCCCATATG-TTTTTCTCCGCCCTGA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TCGCTCCCTAAAGCTTTC 


Xhol 


765 


Forward CGCGGATCCCATATG-TTAAGATGCCGTCCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-ACGCCGACGTTTTTTATTAA 


Xhol 


767 


Forward CGCGGATCCCATATG-CTGACGGAAGGGGAAG 


BamHI- 






Ndel' 




Reverse CCCGCTCGAG-TTTCTGTACAGCAGGGG 


Xhol 


768 


Forward CGCGGATCCCATATG-GCCCCGCAAAAACCCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTCATCCCTTTTTTGAGC 


Xhol 


770 


Forward CGCGGATCCCATATG-TGCGGCAGCGGCG A A 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GCGTTTGTCGAGATTTTC 


Xhol 


771 


Forward CGCGGATCCCATATG-TCCGTATATCGCACCTTC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CGGTTCTTTAGGTTTGAG 


Xhol 


772 


Forward CGCGGATCCCATATG-TTTGCGGCGTTGGTGG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CAATGCCGACATCAAACG 


Xhol 


774 


Forward CGCGGATCCCATATG-TCCGTTTCACCCGTTGC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TCGTTTGCGCACGGCT 


Xhol 


790 


Forward CGCGGATCCCATATG-GCAAGAAGGTCAAA AAf 


BamHI- 






Ndel 
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Reverse CCC GCTCGAG -GGCGTTGTTCIGO A TTTPr; 

900 Forward CGC GGATCCC ATATG -CCGTCTO A A A Tnccn 

Reverse CCCGCTCGAG-ATATGGAAAAGTCTGTTGTC 

901 Forward CGC GGATCCCATATG -Cnco A Trrrrrn a yr T 

Reverse CCCGCTCGAG-AAAATGGAACAATACCAGG 

902 Forward. CCGGAATTCTACAT^TC-TTGCACTTTCAAAGGATAATC 

Reverse CCCGCTCGAG-AAAAATGTACAATGGCGTAC 

903 Forward CCGGAATTCTAGjCTAGC-CAGCGTCAGCAGCACAT 

Reverse CCCGCTCGAG-GAAACTGTAATTCAAGTTGAA 

904 Forward AAAAAAGGTACC-ATGATGCAGCACAATCGTTTC 
Reverse AAACTGCAG-TTAATATCGATAGGTTATATG 

904a Forward AAAAAAGAATTC^CGGCTCGGCATTGTGCAGATGTTGCA 
Reverse AAACTGCAG-TTAATATCGATAGGTTATATG 

905 Forward CG CGGATCCC ATATG -A A C. A A AATATArrr.r Ar 

Reverse CCCGCTCGAG-CCACTGATAACCGACAGAT 

907 Forward CGCGGATCCCATATG-GGCGCGCAACGTGAG 

Reverse CCCGCTCGAG-ACGCCACTGCCAGCG 

908 Forward AAAGAATTC-GCAGAGTTAGTAGGCGTTAATAAAAATAC 
Reverse AAACTGCAG-TTAATATGGTTTTGTCGTTCG 

909 Forward CGCGGATCCCATATG-TGCGCGTGGGAAACTTAT 

CCCGCT^GAG-TCGGTTTTGAAACTTTGGTTTT 

AAAGAATTC-GCATTTGCCGGCGACTCTGCCGAGCG 

AAACTGCAG-TCAGCGATCGAGCTGCTCTTT 

AAAGAATTC-GCTTTCCGCGTGGCCGGCGGTGC 

AAAAAACTGCAG-GTCGACTTATTCGGCGGCTTTTTCCGC 
AAAAA AGAATTC - 

CAAATCCGTCAAAACGCCACTCAAGTATTGAG 
AAAAAACTGCAG-TTACAGTCCGTCCACGCCTTTCGC 
CGCGGATCCCATATG-GAAACCCGCCCCGC 

Reverse CCCGCTCGAG-AGGTTGTGTTCCAGGTTG 

915 Forward CGCGGAT^CCATATG-TGCCGGCAGGCGGAA 

Reverse CCCGCTCGAG-TTTGAAAATATAGGTATCAGG 
914 Forward AAAGAATTC-GACAGAATCGGCGATTTGGAAGCACG 
Reverse AAACTGCAG-CTATATGCGCGGCAGGACGCTCAACGG 

916 Forward CGCGGATCCCATATG-GCAATGATGGCGGCTG 

Reverse CCCGCTCGAG-TTTGGCGGCATCTTTCAT 



Reverse 

910 Forward 
Reverse 

911 Forward 
Reverse 

912 Forward 

Reverse 

913 Forward 



Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI- 

Ndel 

Xhol 

EcoRI- 

Nhel 

Xhol 

Kpnl 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 
BamHI- 
Ndel 
Xhol 
BamHI- 
Ndel 
Xhol 
EcoRI 
PstI 
BamHI- 
Ndel 
Xhol 
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917 Forward AAAAAAGAATTC-CCTGCCGAAAAACCGGCACCGGC 
Reverse AAAAAACTGCAG-TTATTTCCCCGCCTTCACATCCTG 

919 Forward CGC GGATCCC ATATG -TGCC AAAGC A AG AfifATr 

Reverse CCCGCTCGAG-CGGGCGGTATTCGGG 

920 Forward CG CGGATCCCATATG -CACCGCGTCTGGGTC 

Reverse CCCGCTCGAG-ATGGTGCGAATGACCGA 

921 Forward AAAAAAGAATTC-TTGACGGAAATCCCCGTGAATCC 
Reverse AAAAAACTGCAG-TCATTTCAAGGGCTGCATCTTCAT 

922 Forward. CGC GGATCCGCTAGC -TGTACGGCGATGGAGGP 

2 

Reverse CCCGCTCGAG-CAATCCCGGGCCGCC 

923 Forward CGC GG ATCCC AT ATG -TGTTACGCA AT A TTGTPPr 

Reverse CCCGCTCGAG-GGACAAGGCGACGAAG 

925 Forward CGC GGATCCC ATATG -AAACAAATGCTTTT A ClCm 

Reverse CCCGCTCGAG-GCCGTTGCATTTGATTTC 

926 Forward CG CGGATCCC ATATG -TGCGCGC AATT A rrTP 

Reverse CCCGCTCGAG-TCTCGTGCGCGCCG 

927 Forward CGCGGATCCCATATG-TGCAGCCCCGCAGP 



Reverse 
929a Forward 
Reverse 
93 1 Forward 
Reverse 

935 Forward 

Reverse 

936 Forward 



CCCGCTCGAG-GTTTTTTGCTGACGTAGT 

AAAAj^GAATTC-CGCGGTTTGCTCAAAACAGGGCTGGG 

AAAAAATCTAGA-TTAAGAAAGACGGAAACTACTGCC 

AAAAAAGAATrc-GCAACCCATGTTTTGATGGAAAC 

AAAAAACTGCAG-TTACTGCCCGACAACAACGCGACG 
AAAAA AGAATTC - 

GCGGATGCGCCCGCGATTTTGGATGACAAGGC 

AAAAAACTGCAG-TCAAAACCGCCAATCCGCCGACAC 

CG CGGATCCCATATG -GCCGCCGTCGGCGr 



Reverse CCCGCTCGAG-GCGTTGGACGTAGTTTTG 
937 Forward AAAAAAGAATTC-CCGGTTTACATTCAAACCGGCGCAAC 

Reverse AAAAAACTGCAG-TTAAAATGTATGCTGTACGCCAAA 
939a Forward AAAAAAGAATTC-GGTTCGGCAGCTGTGATGAAACC 

Reverse AAAAAACTGCAG-TTAACGCAAACCTTGGATAAAGTTGGC 
950 Forward CGCGGATCCCATATG-GCCAACAAACCGGCAAG 

Reverse CCCGCTCGAG-TTTAGAACCGCATTTGCC 
953 Forward CGCGGATCCCATATG-GCCACCTACAAAGTGGAC 

Reverse CCCGCTCGAG-TTGTTTGGCTGCCTCGAT 
957 Forward CGCGGATCCCATATG-TTTTGGCTGGG A A maa 



EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Nhel 

Xhol 

BamHI- 

Nhel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

Xbal 

EcoRI 

PstI 

Eco RI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 
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Reverse 
958 Forward 



Reverse 
959 Forward 

Reverse 
961 Forward 

Reverse 

972 Forward 

Reverse 

973 Forward 
Reverse 

982 Forward 

Reverse 

983 Forward 

Reverse 

987 Forward 

Reverse 

988 Forward 

Reverse 

989 Forward 

Reverse 

990 Forward. 
2 

Reverse 

992 Forward 

Reverse 

993 Forward 

Reverse 

996 Forward 

Reverse 

997 Forward 



CCCGCTCGAG-GTGAGAAAGGTCGCGC 
CGCGGATCCCATATG-GCCGATGCCGTTGCG 

GCCCAAGCTT-GGGTCGTTTGTTGCGTC 
CGCGGATCCCATATG-CACCACGACGGACACG 

CCCGCTCGAG-GTCGTCGCGGCGGGA 
CGCGGATCCCATATG-GCCACAAGCGACGACG 

CCCGCTCGAG-CCACTCGTAATTGACGC 
AAAAAAGAATTC- 

TTGACTAACAGGGGGGGAGCGAAATTAAAAAC 
AAAAAATCTAGA-TTAAAAATAATCATAATCTACATTTTG 
AAAAAAGAATTC-ATGGACGGCGCACAACCGAAAAC 
AAAAAACTGCAG-TTACTTCACGCGGGTCGCCATCAGCGT 
CGCGGATCCCATATG-GCAGCAAAAGACGTAC 

CCCGCTCGAG-CATCATGCCGCCCATCC 
CGCGGATCCCAlWrG-TTAGCTGTTGCAACAACAC 

CCCGCTCGAG-GAACCGGTAGCCTACG 
CGCGGATCCCATATG-CCCCCACTGGAAGAAC 

CCCGCTCGAG-TAATAAACCTTCTATGGGC 
CGCGGATCCCAIATC-TCTTTAAATTTACGGGAAAAAG 

GCCCAAGCTT-TGATTTGCCTTTCCGTTTT 
CCGGAATTCTACATATG-GTCCACGCATCCGGCTA 

CCCGCTCGAG-TTTGAATTTGTAGGTGTATTGC 
CGCGGATCCGCTAGC-TTCAGAGCTCAGCTT 

CCCGCTCGAG-AAACAGCCATTTGAGCGA 
CGCGGATCCCATATG-GACGCGCCCGCCCG 

CCCGCTCGAG-CCAAATGCCCAACCATTC 
CGCGGATCCCATATG-GCAATGCTGATTGAAATCA 

CCCGCTCGAG-GAACACATCGCGCCCG 
CGCGGATCCCATATG-TGCGGCAGAAAATCCGC 

CCCGCTCGAG-TCTAAACCCCTGTTTTCTG 
CCGGAATTCTAGCTAGC-CGGCACGCCGACGTT 



Reverse CCCGCTCGAG-GACGGCATrGPTr A nn 



Ndel 

Xhol 

BamHI- 

Ndel 

HindHI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

Xbal 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

HindHI 

EcoRI- 

Ndel 

Xhol 

BamHI- 

Nhel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI- 

Nhel 

Xhol 
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Underlined sequences indicate restriction recognition sites. 

The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 1>: 

gOOl . seq 



1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG GTGTCGGCGA ACGAGGTGTC 

51 CGGCAGGGCT TGCGCCCGGA TGGTGCTGGT CATCTGCCAG ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGC TCGGGTACGC ATACTTTACC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTGCG CAGCAAATCG ACAAT CAT C A CGTTTTCGGC 

201 GCGGTTTTTC GGGTCGGTTT GTAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

3 01 CCGTCTGAAG CGATGTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCGGCTTCAT CGGGCAGGTG GGACAATACG G CAT AG 



This corresponds to the amino acid sequence <SEQ ID 2; ORF 001. ng>: 

gOOl .pep 

1 MLPQGKAARR VSANEV5GRA CA RMVLVICO TLPKRDTLNG SGTHTLPVWA 

51 ILPRSLRSKS TIITFSARFF GSVCNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEAMLRKSS GEKHSVHADC PASSGRWDNT A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3>: 



1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGcAssCTT ss . GCTTGGA yGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GATCGTTACG CAGCAAATCG ACAAT CAT C A CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCCTCCGCAT CGGGCAGGTG GGACAAGACG G CAT AG 



This corresponds to the amino acid sequence <SEQ ID 4; ORF 00 1>: 

mOOl -pep 

1 MLPQGKAARR MSANEVCGXL X AWXVLVICO TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TIITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PSASGRWDKT A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 5>: 



1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGCAAGGCT TGGGCTTGGA TGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTACG CAGCAAATCG ACAAT CAT C A CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCTTGTGCAT CGGGCAGGTG GGACAAAACG GCATAG 



This corresponds to the amino acid sequence <SEQ ID 6; ORF 001. a>: 

aOOl.pep 

1 MLPQGKAARR MSANEVCGKA W AWMVLVICO TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TIITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PCASGRWDKT A* 

mOOl/aOOl 96.2% identity over a 131 aa overlap 



mOOl . seq 



aOOl . seq 



mOOl .pep 



aOOl .pep 
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mOOl .pep 
aOOl . pep 

mOOl . pep 
aOOl .pep 



10 



70 



20 



80 



30 



40 



50 



60 



90 100 HO 120 

TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 001 shows 89.3% identity over a 131 aa overlap with a predicted ORF (ORF 001. ng) 
from N. gonorrhoeae; 

mOOl/gOOl 

10 20 30 40 50 60 

mOOl . pep MLPQGKAARRMSANEVCGXLXAWXVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 
> M II M II M I I M I I ! II I | | | | | | | | j | M I I M I I : I I M | | | M | f | | | 
gOOl MLPQGKAARRVSANEVSGRACARMVLVICQTLPKRDTLNGSGTHTLPVWAILPRSLRSKS 
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g003 . seq 








1 


ATGGTCGTAT 


TCGTGGCTGA 


AGGCGTATTC 


51 


CTTGGTATTG 


CTCTTCGGTC 


AGGGTGCGTT 


101 


TTTTTATACG 


TTGCCGCGTC 


GAAGCCTTTG 


151 


TTTGCCCGGC 


AGCGGTTCGT 


CGGCTTTGCG 


201 


CGTTGGGGTT 


TTTAATCAGG 


TAGTCCTGAT 


251 


AAGTTTTTCA 


GCGGTTCGTT 


TTCAACAACG 


301 


CTGCTCGCGT 


TTGAGGGCGG 


CGGCGATGAC 


351 


TGTAGTACAC 


GCCGCTGCGG 


TATTGCGTGC 


401 


TTGAGGCTGG 


TCGGATCAAC 


GACGCGGAAA 


451 


CAGgCTGagt 


TTGTCGGCAT 


CGTaggtcac 


501 


CCGTATGGCG 


GTaggacact 


tctTCgtanc 


551 


TTGGCgttac 


cGGATACCGC 


gtcaACCACG 


601 


ggCTTCCAAg 


ccccaaaagc 


agccgccggc 


651 


tcatgattGC 


TGa 



identified in N. gonorrhoeae <SEQ ID 7>: 



GGTCGCGCTG 
TGAGTTCGGC 
CCTTGCGGTG 
GATGTCGATG 
GGTATTCCTC 
AGGGGCAGTT 
GGCTTTTTCG 
CGGTGTCGTT 
TAATATTGCA 
tTTGACGGTC 
TcGGGtTTTC 
CCGTcgatgc 
gaagtaaatg 



TTTTGGGTCA 
GTCACTCGGT 
CGGCTTTGGT 
TGGCAGTAGC 
GGCGTCGTAG 
GGTATTTTTG 
GCGGGGTCGG 
ACCCTGTTTG 
GGATGTCGTC 
TCGGCATGAC 
CGTGttGCCG 
gttggaAATa 
gtgcccgtgt 



This corresponds to the amino acid sequence <SEQ ID 8; ORF 003.ng>: 

g003.pep 

1 MWFVAEGVF GRAVLGHLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGFG 

51 FARQRFVGFA DVD VAVAVGV FNOWLMVFL GWEVFQRFV FNNEGQLVFL 

101 LLAFEGGGDD GFFGGVG VVH AAAVLRAGW TLFV EAGRIN DAEIILQDW 
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151 QAE FVGIVGH FDGLGMTRMA VGHFFVRVFR VAVGVTGYRV NHAVDALEIG 
201 FQAPKAAAGE VNGARVHDC 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 9>: 

m003 . seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCATATTC GGTCGCGCTG TTTTGGGTAA 

51 CTTGsTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGGG CGGTCTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CAGCkTTGCG GATGTCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAAG TAGTCCTGAT GGTATTCCTC GGCATCGTAG 

251 AAGTTTTtCA GCGGCTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGk CGGCGATGAC GGCTTTTTCG kCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTAC CGGTGTCGTT GCCCTGTTTG 

4 01 TTGAGGCTGG TCGGATCAAC GACGCGGAAG AAATATTGCA GGATGTCGTC 

4 51 TAGGCTGAGT TTGTCGGCAT CGTAGGTCAC TTTGACGGTT TCGGCGTGGC 

501 CCGTATGGCG GTAGGACACG TCTTCATAGC TCGGATTTTT CGTGTTGCCG 

551 TTGGCGTAGC CGGATACCGC GTCAACCACG CCGTCGATGC GTTGGAAATA 

601 GGCTTCCAAG CCCCAGAAGC AGCg . CCGGC GAGGTAAATG GTGCGCGTGT 

651 TCATGATTTT TGA 



This corresponds to the amino acid sequence <SEQ ID 10; ORF 003>: 

m003.pep Length: 221 

MWFVAEGIF GRAVLGNLXL LFGQGAFEFG VTRFFIRCRV EAFALRGGLG 
FARQRFVSXA DVDV AVAVGV FNQWLMVFL GIVEVFQRLV FNNEGQLVFL 
LLAFEGXGDD GFFXGVG WH AAAVLRTGW ALFV EAGRIN DAEEILQDVV 
*AE FVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALE I 
GFQAPEAAXG EVNGARVHDF * 



1 
51 
101 
151 
201 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 1 1>: 

a003.seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCATATTC GGTCGCGCTG TTTTGGGTAA 

51 CTTGGTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGTG CGGTCTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CGGCTTTGCG GATATCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAAG TAGTCCTGAT GGTATTCCTC GGCATCGTAG 

251 AAGTTTTTCA GCGGCTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGG CGGCGATGAC GGCTTTTTCG GCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTAC CGGTGTCGTT GCCCTGTTTG 

4 01 TTGAGGCTGG TCGGATCAAC GACGCGGAAG AAATATTGCA GGATGTCGTC 

4 51 TAGGCTGAGT TTGTCGGCAT CGTAGGTCAC TTTGACGGTT TCGGCGTGGC 

501 CCGTATGGCG GTAGGACACG TCTTCATAGC TCGGATTTTT CGTGTTGCCG 

551 TTGGCGTAGC CGGATACCGC GTCAACCACG CCGTCGATGC GTTGGAAATA 

601 GGCTTCCAAG CCCCAGAAGC AGCCGCCGGC GAGGTAGATG GTGCGCGTGT 

651 TCATGATTTT TGA 



This corresponds to the amino acid sequence <SEQ ID 12; ORF 003.a>: 

aO03 .pep 

1 MWFVAEGIF GRAVLGNLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGLG 

51 FARQRFVGFA DI DVAVAVGV FNQWLMVFL GIVEVFQRLV FNNEGQLVFL 

101 LLAFEGGGDD GFFGGVG WH AAAVLRTGW ALFV EAGRIN DAEEILQDW 

151 *AE FVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALE I 

201 GFQAPEAAAG EVDGARVHDF * 



m003/a003 95.9% identity over a 220 aa overlap 

10 20 30 40 50 60 

m003 . pep MWFVAEG I FGRAVLGNLXLLFGQGAFEFGVTRFFIRCRVEAFALRGGLG FARQRFVSXA 

II MINI I I III Ml I HI II I il II I I! UMl I I I M ||||||||||: I 

a003 M W FVAE G I FG RAVLGNLVLLFGQGAFEFGVTRFF I RCRVE A FALRCGLG FARQRFVGFA 

10 20 30 40 50 60 
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m003 . pep 
a003 

m003 . pep 
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m003.pep 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 003 shows 88.6% identity over a 219 aa overlap with a predicted ORF (ORF 003. ng) 
from N. gonorrhoeae: ' 

m003/g003 
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g003 
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m003 . pep 
g003 

m003 .pep 
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The following partial DNA sequence was identified in N, gonorrhoeae <SEO ID 13>* 

g004.seq ^ 

1 ATGgtagAAC GG CAT AT CCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GCGCCCATGC CAACAagtga gccaAAtgtT CGGCGGCAGG GCCTacgatT 

101 TCCGCGCCGA TAAagcggcc gGTGgctTTT tcgGCataca ggcgcaTatg 

151 gCCTTTGTTT ACCAgcatca cgcggctgcg accttgaTTT TTGAACGATA 

201 CTTCGCCgaT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

251 TATTTCAAAC CGACAAAGCC GATTTGCgga ctggtaaACA CCACGCCAAT 

301 GGTgctgcgg cGCAAACCGC TGCCGATATt cgGgtagcgg ccccgcgtta 

351 ttgcccggca atcttacctt ggtcggcggc ttcatGCAGC AGGGGCagtt 

4 01 ggttggacgc gtcgcccgca ataAAGATAT GCGGAATgct ggtCTGCATo 

in, r^™^ AT CGGCAACGGG tacgccgcgc gcgtctttgT CGATATTGAT 

501 GTTTTCCAAA CCGATATtgT CAACGTTCGG ACGGCgACCT ACGGCTGCCA 
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551 ACATATATTC GGCAACAAAT ACGCCTTTTT CGCCATCCTG CTCCCAATGG 

601 ACTtctACAT TGCCGTCTGC GTCGAGTTTG ACCTCGGTTT TAG CAT CC AG 

651 ATGCAGTTTC AATtctTCTC CGAACACGGC TTTCGCCTCG TCTGAAACAA 

7 01 CGGGGTCGGA AATGCCGCCG ATGATTCCGC CCAAACCGAA AATTTCAACT 

751 TTCACACCCA AACGGTGCAA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 14; ORF 004.ng>: 

g004 .pep 

1 MVERHIQHLR NGHLHLMRPC QQVSQMFGGR AYDFRADKAA GGFFGIQAHM 

51 AFVYQHHAAA TLIFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAN 

101 GAAAQTAADI RVAAPRYCPA ILPWSAASCS RGSW LDASPA IKICGMLVCM 

151 VSGSATGTPR ASLSILMFSK PILSTFGRRP TAANIYSATN TPFSPSCSQW 

201 TSTLPSASSL TSVLASRCSF NSSPNTAFAS SETTGSEMPP MIPPKPKIST 

251 FTPKRCNA* 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 15>: 

m004 . seq 

1 ATGGTAGAAC GGCATATCCA GCATTTGCGG AACGGT CATC TTCATTTGAT 

51 GTGCCCAAGC CAACAGGTGC GCCAAATGTT CGGCGGCAGG GCCTACGATT 

101 TCCGCGCCGA TAAAGCGGCC GGTGGCTTTT TCGGCATACA GGCGCATATG 

151 GCCTTTGTTC ACCAGCATCA CGCGGCTGCG GCCTTGGTTT TTGAACGATA 

201 CTTCGCCGAT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

2 51 TATTTCAGAC CGACAAAGCC GATTTGCGGA CTGGTAAACA CCACGCCGAT 

3 01 GGTGCTGCGC CGCAAACCGC CGCCGATATT CGGGTAGCGG CCGCGTTATC 
351 GCCGGCAATC TTGCCTTGGT CGGCAGCTTC ATGCAGCAGA GGCAGTTGGT 

4 01 TGGACGCATC GCCTGCGATG AAGATATGCG GAATACTGGT CTGCATGGTC 
4 51 AGCGGGTCGG CAACAGGTAC GCCGCGCGCA TCTTTTTCGA TATTGATATT 
501 TTCCAAACCG ATATTGT CAA CGTTCGGACG GCGGCCCACG GCTGCCAGCA 
551 TATATTCGGC AACAAATACG CCTTTTTCGC CATCCTGCTC CCAATGGACT 
601 TCTACATTGC CGTCTGCATC GAGTTTGACC TCGGTTTTAG CATCCAGATG 
651 CAGTTTCAAT TCTTCGCCGA ACACGGCGTT CGCCTCGTCT GAAACGACGG 
701 GGTCGGAAAT GCCGCCGATG ATTCCGCCCA AACCGAAAAT TTCAACTTTC 
7 51 ACGCCCAAAC GGTGCAATGC CTGA 

This corresponds to the amino acid sequence <SEQ ID 16; ORF 004>: 

m004 .pep 

1 MVERHIQHLR NGHLHLMCPS QQVRQMFGGR AYDFRADKAA GGFFGIQAHM 

51 AFVHQHHAAA ALVFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAD 

101 GAAPQTAADI RVAAALSPAI LPWSAASCSR GSW LDASPAM KICGILVCMV 

151 SGSATGTPRA SFSILIFSKP ILSTFGRRPT AASIYSATNT PFSPSCSQWT 

201 STLPSASSLT SVLASRCSFN SSPNTAFASS ETTGSEMPPM IPPKPKISTF 

251 TPKRCNA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 17>: 

a004 .seq 

1 ATGGTAGAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GTGCCCAAGC CAACAGGTGC GCCAAATGTT CGGCGGCCGG ACCTACGATT 

101 TCTGCGCCGA TGAAGCGGCC GGTGGCTTTT TCGGCATACA GGCGCATATG 

151 GCCTTTGTTT ACCAGCATCA CGCGGCTGCG GCCTTGGTTT TTGAACGATA 

201 CTTCGCCGAT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

2 51 TATTTCAAAC CGACAAAGCC GATTTGCGGA CTGGTGAACA CTACGCCGAT 

301 GGTGCTGCGG CGCAAACCGC CGCCGATATT CGGGTAGCGG CCGCGTTATC 

351 GCCGGCAATC TTGCCTTGGT CGGCGGCTTC ATGCAGCAGG GGCAGTTGGT 

4 01 TGGACGCGTC GCCCGCAATA AAGATATGCG GAATACTGGT CTGCATAGTC 

4 51 AGCGGATCGG CAACGGGTAC GCCGCGCGCA TCTTTTTCGA TATTGATGTT 

501 TTCCAAACCG ATATTGT CAA CGTTCGGACG GCGGCCTACG GCTGCCAGCA 

551 TATATTCGGC AACAAATACG CCTTTTTCGC CATCCTGCTC CCAATGGACT 

601 TCTACATTGC CGTCTGCGTC GAGTTTGGCC TCGGTTTTAG CATCCAAATG 

651 CAGTTTCAAT TCTTCACCGA ACACGGCTTT CGCCTCGTCT GAAACGACGG 

701 GGTCGGAAAT GCCGCCGATG ATGCCACCCA AACCGAAAAT TTCAACTTTC 

7 51 ACGCCCAAAC GGTGCAATGC CTGA 
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This corresponds to the amino acid sequence <SEQ ID 18; ORF 004.a>: 

a004.pep 

1 MVERHIQHLR NGHLHLMCPS QQVRQMFGGR TYDFCADEAA GGFFGIQAHM 

51 AFVYQHHAAA ALVFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGEHYAD 

101 GAAAQT AAD I RVAAALSPAI LPWSAASCSR GSWL DASPAI KICGILVCIV 

151 SGSATGTPRA SFSILMFSKP ILSTFGRRPT AASI YSATNT PFSPSCSQWT 

201 STLPSASSLA SVLASKCSFN SSPNTAFASS ETTGSEMPPM MPPKPKISTF 

251 TPKRCNA* 

m004/a0O4 94.9% identity over a 257 aa overlap 



m004 .pep 
a004 
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a004 
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a004 LPWSAASCSRGSWLDASPAIKICGILVCIVSGSATGTPRASFSILMFSKPILSTFGRRPT 

130 140 150 160 170 180 

190 200 210 220 230 240 
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a004 



250 

m004 .pep IPPKPKISTFTPFCRCNAX 
: II I I I I II II I M I M I 
a004 MPPKPKISTFTPKRCNAX 

250 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 004 shows 93.4% identity over a 258 aa overlap with a predicted ORF (ORF 004 ns) 
from N. gonorrhoeae: ' 

xn004/g004 
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120 130 140 150 160 170 179 

IU004 .pep ILPWSAASCSRGSWLDASPAMKICGILVCMVSGSATGTPRASFSILIFSKPILSTFGRRP 
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250 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 19>: 

gOOB.seq 

1 ATGGGGATGG ACAATATTGA TATGTTCATG CCTGAACAAG AGGAAATCCA 

51 ATCAATGTGG AAAGAAATTT TACTGAATTA CGGTATTTTC CTGCTCGAAC 

101 TGCTTACCGT GTTCGGCGCA ATTGCGCTGA TTGTGTTGGC TATCGTACAG 

151 AGTAAGAAAC AGTCGGAAAG CGGCAGTGTC GTACTGACAG ATTTTTCGGA 

201 AAATTATAAA AAACAGCGGC AATCGTTTGA AACATTCTTT TTAAGCGAGG 

251 AAGAGACAAA ACATCAGGAA AAAAAAGAAA AGAAAAAGGA AAAGGCGGAA 

301 GCCAAAGCAG AG AAAAAG CG TTTGAAGGAG GGCGGGGAGA AATCTGCCGA 

351 AACGCAAAAA TCCCGCCTTT TTGTGTTGGA TTTTGACGGC GATTTGTATG 

401 CACACGCCGT AGAATCCTTG CGTCATGAGA TTACGGCGGT GCTTTTGATT 

4 51 GCCAAGCCTG AAGATGAGGT TCTGCTCAGA TTGGAAAGTC CGGGCGGCGT 

501 GGTTCACGGT TACGGTTTGG CGGCTTCGCA GCTTAGGCGT TTGCGCGAAC 

551 GCAATATTCC GCTGAccgtc gccgTCGATA AGGTCGCGGC AAGCGgcggc 

601 tatatgatgg cgtgtgtgGC GGATAAAATT GTTTCCGCtc cgtttgcggt 

651 catcggttcg gtgggtgtgg tgGcggaagt gcCGAATATC CAccgCctGT 

701 TGAAAAAACA T GAT ATT GAT GTGGATGTGA TGACGGCGGG CGAATTTAAG 

751 CGCACGGTTA CTTTTATGGG TGAAAATACG GAAAAGGGCA AACAGAAATT 

801 CCGGCAGGAA CTGGAGGAAA CGCATCAGTT GTTCAAGCAG TTTGTCAGTG 

8 51 AAAACCGCCC CGGGTTGGAT ATTGAAAAAA TAGCGACGGG CGAGCATTGG 

901 TTCGGCCGGC AGGCGTTGGC GTTGAACTTG ATTGACGAGA TTTCGACCAG 

951 TGATGATTTG TTGTTGAAAG CGTTTGAAAA CAAACAGGtt aTCGAAGTGA 

1001 AAT AT CAGGA GAAG CGAAGC CTGATCCAGC GCATTGGTTT GCAGGCGGAA 

1051 GCTTCCGTTG AAAAGTTGTT TGCCAAACTT GTCAACCGGC GAGCGGATGT 

1101 GATGTAG 

This corresponds to the amino acid sequence <SEQ ID 20; ORF 005. ng>: 

g005 . pep 

1 MGMDNIDMFM PEQEEIQSMW KEILLNYG IF LLELLTVFGA IALIV LAIVO 

51 SKKQSESGSV VLTDFSENYK KQRQSFETFF LSEEETKHQE KKEKKKEKAE 

101 AKAEKKRLKE GGEKSAETQK SRLFVLDFDG DLYAHAVESL RHEITAVLLI 

151 AKPEDEVLLR LESPGGWHG YGLAASQLRR LRERNIPLTV AVDKVAASGG 

201 YMMACVADK I VSAPFAVIGS VGWAE VPNI HRLLKKHDID VDVMTAGEFK 

251 RTVTFMGENT EKGKQKFRQE LEETHQLFKQ FVSENRPGLD IEKIATGEHW 

301 FGRQALALNL IDEISTSDDL LLKAFENKQV IEVKYQEKRS LIQRIGLQAE 

351 ASVEKLFAKL VNRRADVM* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 21>: 

m005 . seq 

1 ATGGACAATA TTGACATGTT CATGCCTGAA CAAGAGGAAA TCCAATCAAT 

51 GTGGAAAGAA ATTTTACTGA ATTACGGTAT TTTCCTGCTC GAACTGCTTA 

101 CCGTGTTCGG CGCAATTGCG CTGATTGTGT TGGCTATCGT ACAGAGTAAG 

151 AAACAGTCGG AwAGCGGCAG TGTCGTACTG ACGGATTTTT CGGAAAATTA 

201 TAAAAAACAG CGGCAATCGT TTGAAGCATT CTTTTTAAGC GGGGAAGAGG 

251 CACAACATCA GGAAAAAGAG GAAAAGAAAA AGGAAAAGGC GGAAGCCAAA 
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"301 

Jvl 




A . CGTTTGAA 


O -J -L 




CTTTTTGTGT 


*i U JL 


IN IN IV IV IN IV IN IN IM W 


XT XT XTXT XT XT XT X T M v T 

NNNNNNNNNN 


4^1 

1 Jl 


In IV IV IV IV IV IN IV IN IN 


XT M VT XT XT V! MM XT XT 


O U x 


KTMTvFKTKT KTK1MMM 
IV IV IV rJ IN IM IN IN IN IN 


XT XT XT XTXT X TXT X T Vtv 

N N N NNNNNNN 


551 


KTTJ NTM MMKTM M NT 
IV IV IV IV IV IV IV IV IN IN 


XT XT XTXT XT XT XT XT x y v t 

NNNNNNNNNN 


DUX 


r\i buLbl bib 


TGGCGGATAA 


DDI 


I rCGGTGGGT 


GTGGTGGCGG 


701 


AACATGATAT 


TGATGTGGAT 


751 


GTTACTTTTA 


TGGGTGAAAA 


801 


GGAACTGGAG 


GAAACG CAT C 


851 


GCCCTCAATT 


GGATATTGAG 


901 


CGGCAGGCGT 


TGGCGTTGAA 


951 


TTTGTTGTTG 


AAAGCGTTTG 


1001 


AGGAGAAGCA 


AAGCCTGATC 


1051 


GTTGAAAAGT 


TGTTTGCCAA 


1101 


G 





GGAGGGTGGG GAGAAATCTG CCGAAACGCA 
TGGANNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNGCGAGCGG CGGTTATATG 
AATTGCTTCC GCTCCGTTTG CGATTGTCGG 
AAGTACCGAA TATCCACCGC CTGTTGAAAA 
GTGATGACGG CGGGCGAATT TAAGCGCACG 
TACGGAAAAG GGCAAACAGA AATTCCGACA 
AGTTGTTCAA GCAGTTTGTC AGCGAGAACC 
GAAGTGGCAA CGGGCGAGCA TTGGTTCGGT 
CTTGATTGAC GAGATTTCGA CCAGTGATGA 
AAAACAAACA GGTTATCGAA GTGAAATATC 
CAGCGCATTG GTTTGCAGGC GGAAGCTTCT 
ACTTGTCAAC CGGCGGGCGG ATGTGATGT A 



This corresponds to the amino acid sequence <SEQ ID 22; ORF 005>: 

m005 . pep 

1 MDNIDMFMPE QEEIQSMWKE ILLNYG 1FLL ELLTVFGAIA LIVLAIVQSK 

51 KQSXSGSWL TDFSENYKKQ RQSFEAFFLS GEEAQHQEKE EKKKEKAEAK 

101 AEKXRLKEGG EKSAETXKSR LFVLXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXASGGYM 

201 MACVAD KIAS APFAIVGSVG WAEV PN T H "R LLKKHDIDVD VMT AGE FKRT 

251 VTFMGENTEK GKQKFRQELE ETHQLFKQFV SENRPQLDIE EVATGEHWFG 

301 RQALALNLID EISTSDDLLL KAFENKQVIE VKYQEKQSLI QRIGLOAEAS 

351 VEKLFAKLVN RRADVM* 



The following partial DNA sequence was 

a005. seq 

1 ATGGACAATA TTGACATGTT 
GTGGAAAGAA ATTTTACTGA 
CCGTGTTCGG CGCAATTGCG 
AAACAGTCGG AAAGCGGCAG 
TAAAAAACAG CGGCAATCGT 
CAAAACATCA GGAAAAAGAG 
GCAGAGAAAA AGCGTTTGAA 
AAAATCCCGC CTTTTTGTGT 
CCGTAGAATC CTTGCGTCAT 
CCTGAAGATG AGGTTCTGCT 
CGGTTACGGT TTGGCGGCTT 
TTCCGCTGAC CGTCGCCGTC 
ATGGCGTGTG TGGCGGATAA 
TTCGGTGGGT GTTGTAGCGG 
AACATGATAT TGATGTGGAT 
GTTACTTTTA TGGGTGAAAA 
GGAACTGGAG GAAACG CATC 
GCCCTCAATT GGATATTGAG 
CGGCAGGCGT TGGCGTTGAA 
TTTGTTGTTG AAAGCGTTTG 
AGGAGAAGCA AAGCCTGATC 
GTTGAAAAGT TGTTTGCCAA 
G 



identified in N. meningitidis <SEQ ID 23>: 



51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 



CATGCCTGAA 
ATTACGGTAT 
CTGATTGTGT 
TGTCGTACTG 
TTGAAGCATT 
GAAAAGAAAA 
GGAGGGTGGG 
TGGATTTTGA 
GAGATTACGG 
TAGATTGGAA 
CGCAGCTTAG 
GATAAGGTGG 
AATTGTTTCC 
AAGTACCGAA 
GTGATGACGG 
TACGGAAAAG 
AGTTGTTCAA 
GAAGTGGCAA 
CTTGATTGAC 
AAAACAAACA 
CAGCGCATTG 
ACTTGTCAAC 



CAAGAGGAAA 

TTTCCTGCTC 

TGGCTATCGT 

ACGGATTTTT 

CTTTTTAAGC 

AGGAAAAGGC 

GAGAAAT CTT 

CGGCGATTTG 

CGGTGCTTTT 

AGTCCGGGCG 

GCGTTTGCGC 

CGGCGAGCGG 

GCTCCGTTTG 

TATCCACCGC 

CGGGCGAATT 

GGCAAACAGA 

GCAGTTTGTC 

CGGGCGAGCA 

GAGATTTCGA 

GGTTATCGAA 

GTTTGCAGGC 

CGGCGGGCGG 



TCCAATCAAT 

GAACTGCTTA 

ACAGAGTAAG 

CGGAAAATTA 

GGGGAAGAGG 

GGAAGCCAAA 

CCGAAACGCA 

TATGCACACG 

GATTGCCAAG 

GCGTGGTTCA 

GAACGCAATA 

TGGTTATATG 

CGATTGTCGG 

CTGTTGAAAA 

TAAGCGCACG 

AATTCCGACA 

AGCGAGAACC 

TTGGTTCGGT 

CCAGTGATGA 

GTGAAATATC 

GGAAGCTTCT 

ATGTGATGTA 



This corresponds to the amino acid sequence <SEQ ID 24; ORF 005.a>: 

aOOS.pep 

MDNIDMFMPE QEEIQSMWKE ILLNYG IFLL ELLTVFGAIA LIVLAIVQSK 
KQSESGSVVL TDFSENYKKQ RQSFEAFFLS GEEAKHQEKE EKKKEKAEAK 
AEKKRLKEGG EKSSETQKSR LFVLDFDGDL YAHAVESLRH EITAVLLIAK 
PEDEVLLRLE SPGGWHGYG LAASQLRRLR ERNIPLTVAV DKVAASGGYM 
MACVADKIVS APFAIVGSVG WARVPWtot LLKKHDIDVD VMT AGE FKRT 
VTFMGENTEK GKQKFRQELE ETHQLFKQFV SENRPQLDIE EVATGEHWFG 



1 
51 
101 
151 
201 
251 
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301 
351 



RQALALNLID EISTSDDLLL KAFENKQVIE VKYQEKQSLI QRIGLQAEAS 
VEKLFAKLVN RRADVM * 



moos/aoos 79.2% identity over a 366 aa overlap 



m005 .pep 
a005 

m005 .pep 
a005 

m005 . pep 
a005 

m005 . pep 
a005 

m005 .pep 
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mOOS.pep 
a005 
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TDFSENYKKQRQSFEAFFLSGEEAKHQEKEEKKKEKAEAKAEKKRLKEGGEKSSETQKSR 
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m005 .pep 



a005 



RRADVMX 
I II I I I I 
RRADVMX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 005 shows 77.0% identity over a 366 aa overlap with a predicted ORF (ORF 005 .ng) 
from N. gonorrhoeae: 

m005/g005 
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mOOS.pep 
g005 
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ra005 . pep 



120 130 140 150 160 170 

m005 . pep ^RLFVLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

g005 SRLFVLDFDGDLYAHAVESLRHEITAVLLIAKPEDEVLLRLE SPGGVVHGYGLAASQLRR 

130 1^0 150 160 170 180 
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XXXXXXXXXXXXXXXXASGGYMMACVADKIASAPFAIVGSVGVVAEVPNIHRLLKKHDID 
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g005 LRER NIPLTVAVDKVAASGGY^CVADKIVSAPFAVIGSVGVVAEVPNIHRLLKKHDID 

190 2 00 210 220 230 240 
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m005 . pep FGRQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQAEASVEKLFAKL 
1 I 1 II M I M I I I I I I I I I I I II I I I I M I II I I I I II : I I M I II I 1 I I I I j | II I I II 
g005 FGRQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKRSLIQRIGLQAEASVEKLFi^L 

3X0 320 330 340 350 360 

360 

mOOS.pep VNRRADVMX 
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g005 VNRRADVMX 



mOOS.pep 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 25>* 

1 ATGCTGCTGG TGCTggaatt ttggttCGGc gtGtCGGCGG TGGGCatact 

51 tgCGTTGTTT TTATGGCttt TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC gcgACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGGCAGCT GTACCGCCAT TACGGACTGG TTTCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCG 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACGGCA GCGCGGGGCA TATTTATTCG GTCGGCACTT ATCTGTGGAT 

351 GTTTGCCATG AGTTTGGACG ATGTGCCGCG ATTGGTCGAA CAATATTCCA 

4 01 AT TTGAAAGA CAT CGGACAA CGGATAGAGT GGTCGGAACG GAACATCAAA 

4 51 GCCGGAACTT GA 

This corresponds to the amino acid sequence <SEQ ID 26; ORF 006.ne>* 

g006.pep * & 

1 MLLVLEFWFG VSAVGI LALF LWLLPRFAAI SENLYFRLNN SLERDNHFIR 

5 1 KGDERQLYRH YGLVSRLRVL ISNREAFGYL CVGAAMGILF GFAFVM MTLK 

101 GYGSAGHIYS VGTYLWMFAM SLDDVPRLVE QYSNLKDIGQ RIEWSERNIK 

151 AGT* 



The following partial DNA sequence was identified injV. meningitidis <SEO ID 27>* 
m006.seq ^ 

1 ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACC GGCGGCAGCT GTACCGCCAT TACGGACTGC TTGCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

3 01 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 
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3 51 GTTTGCCATG AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 
401 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGGAACG GAACATCAAA 

4 51 GCCGGAACTTGA 

This corresponds to the amino acid sequence <SEQ ID 28; ORF 006>: 

mO 0 6 . pep 

1 MLLVLEFWVG VSAVGILALF LWLLPRFAAI SENLYFRLNN SLERDNHFIR 
51 KGDRRQLYRH YGLLARLRVL ISNREAFGYL CVGTAMGILF GFAFVM MTLK 
101 GYSSAGHVYS VGTYLWMFAM SLDDVPRLVE QYSttLKDIGQ RIEWSERNIK 
151 AGT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 29>: 

a006, seq 

1 ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAAGAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGGCAGCT GGACCGCCAT TACGGACTGC TTGCGCGCCT 

2 01 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 

351 GTTTGCCATA AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 

4 01 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGAAACG GAACATCAAA 

4 51 GCCGGAACTT GA 

This corresponds to the amino acid sequence <SEQ ID 30; ORF 006.a>: 

a006.pep 

1 MLLVLEFWVG VSAVGILALF LWLLPRFAAI SENLYFRLKN SLERDNHFIR 
51 KGDERQLDRH YGLLARLRVL ISNREAFGY L CVGTAMGILF GFAFVM MTLK 
101 GYSSAGHVYS VGTYLWMFAI SLDDVPRLVE QYSNLKDIGQ RIEWSKRNIK 
151 AGT* 

m006/a006 96.7% identity over a 153 aa overlap 

10 20 30 40 50 60 

m00 6.pep MLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERDNHFIRKGDRRQLYRH 

■ I I I I I I 1 II I I ! i I I I I I I I I I | , | | | ■ | | : : I I I I I M I I I M j I : I I I II 

a006 MLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLKNSLERDNHFIRKGDERQLDRH 

10 20 30 40 50 60 

70 80 90 100 110 120 

m006.pep YGLLARLRVLISNREAFGYLCVGTAMG I LFGFAFVMMTLKGYSSAGHVYS VGTYLWMFAM 

M M I I I I I I I I I 1 I I I I I I I I II II M I I II I I I I I I I M I I I I M II I I I I I : 

a006 YGLLARLRVLISNREAFGYLCVGTAMGI LFGFAFVMMTLKGYSSAGHVYS VGTYLWMFAI 

70 80 90 100 110 120 

130 140 150 

m006.pep SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
I I I I I I I I I II I I I I I I I I I I I I I I : I I f | | | ! | 
a006 SLDDVPRLVEQYSNLKDIGQRIEWSKRNIKAGTX 

130 140 150 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 006 shows 95.4% identity over a 153 aa overlap with a predicted ORF (ORF 006.ng) 
from N. gonorrhoeae: 

m006/g006 

10 20 30 40 50 60 

mOO€ . pep MLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERDNHFIRKGDRROLYRH 



BNSOOCID: <WO 0957280A2_I_> 



WO 99/57280 ^ ^ PCT/US99/09346 

176 



Nil III I MIIIIIIIIIIIMIIIMIMIIIIMIIIMIIIIIMIihMIIII 

g006 MLLVLEFWFGVSAVGILALFL»WLLPRFAAISENLYFRLNNSLERDNHFIRKGDEROLyRH 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 0 6 . pep YGLLARLRVLI SNREAFGYLCVGTAMGI LFGFAFVMMTLKGYSSAGHVYSVGTYLWMFAM 

lll :: IIIMIIMIIIIIIII|:|IMMIIMMIIIII|:MI|:|IMIMIIIII 

9° 06 YGLVSRLRVLI SNREAFGYLCVGAAMGI LFGFAFVMMTLKGYGSAGHI YSVGTYLWMFAM 

70 80 90 100 110 120 

130 140 150 

mOO 6 . pep SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 

MIIIIIIMMIIIMMMMIIIIIIIIII 

9118 SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGT 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 31>- 

g006-l.seq 

1 ATGTGGAAAA TGTTGAAACA CAT AG CC AAA ACCCACCGCA AGCGATTGAT 

51 TGGCACATTT TCCCCGGTCG GACTGGAAAA CCTTTTGATG CTGGGGTATC 

101 CGGTGTTTGG CGGCTGGGCG ATTAATGCCG TGATTGCGGG GAGGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTATTT TTGATGTGGC TGGTCGGTGC 

201 GGCACGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

251 TCGCCGTGCC GGTTGTGTTG GAACAACGGC AGCGGCAAGT CCCGCATTCA 

301 GCGGTAACTG CACGGGTTGC CCTGTCGCGT GAATTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGTATCCATA TTCGGCGCGT 

4 01 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

4 51 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

501 AAACCTGTAT TTCCGCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGAAAAGG CGACGAGCGG CAGCTGTACC GCCATTACGG ACTGGTTTCG 

601 CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGCT ATCTCTGCGT 

651 CGGCGCGGCG ATGGGTATTT TGTTCGGCTT TGCTTTTGTG ATGATGACGC 

701 TCAAAGGCTA CGGCAGCGCG GGGCATATTT ATTCGGTCGG CACTTATCTG 

7 51 TGGATGTTTG CCATGAGTTT GGACGATGTG CCGCGATTGG TCGAACAATA 

8 01 TTCCAATTTG AAAGACATCG GACAACGGAT AGAGTGGTCG GAACGGAACA 
8 51 TCAAAGCCGG AACTTGA 

This corresponds to the amino acid sequence <SEQ ID 32; ORF 006-l.ng>: 

g006-l.pep 

1 MWKMLKHIAK THRKRLIGTF SPVGLENLLM LGYPVFGGWA INAVIAGRVW 

51 Q ALLYALWF LMWLVGAA RR IADTRTFTRI YTEIAVPWL EQRQRQVPHS 

101 AVTARVALSR EFVSFFEEHL PIA ATSWSI FGACIMLLVL E FWVGVSAVG 

151 ILALFLWLLP RFAAISENLY FRLNNSLERD NHFIRKGDER QLYRHYGLVS 

201 RLRVLISNRE AFGY LCVGAA MGILFGFAFV M MTLKGYGSA GHIYSVGTYL 

2 51 WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERNIKAGT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33>- 

m006-l.seq 

1 ATGTGGAAAA TGTTGAAACA CATAGCCCAA ACCCACCGCA AGCGATTGAT 

51 TGGCACATTT TCCCTGGTCG GACTGGAAAA CCTTTTGATG CTGGTGTATC 

101 CGGTGTTTGG CGGCCGGGCG ATCAATGCCG TGATTGCGGG GGAGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTGCTT TTGATGTGGC TGGTCGGTGC 

2 01 GGTGCGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

2 51 TCGCCGTGCC GGTCGTGTTG GAACAGCGGC AGCGACAAGT CCCGCATTCG 

301 GCGGTAACTG CGCGGGTTGC CCTGTCGCGT GAGTTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGTATCCATA TTCGGCGCGT 

4 01 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

4 51 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

501 AAACCTGTAT TTCCGCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGAAAAGG CGACCGGCGG CAGCTGTACC GCCATTACGG ACTGCTTGCG 

€01 CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGCT ATCTCTGCGT 
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651 CGGCACGGCG ATGGGTATTT 

7 01 TCAAAGGCTA CAGCAGCGCG 

7 51 TGGATGTTTG CCATGAGTTT 

801 TTCCAATTTG AAAGACATCG 

851 TCAAAGCCGG AACTTGA 



TGTTCGGCTT TGCTTTTGTG ATGATGACGC 
GGGCATGTCT ATTCGGTCGG CACTTATCTG 
GGACGACGTG CCGCGATTGG TCGAACAATA 
GACAACGGAT AGAGTGGTCG GAACGGAACA 



This corresponds to the amino acid sequence <SEQ ID 34; ORF 006- 1>: 

m006-l . pep 

1 MWKMLKHIAQ THRKRLIGTF SLVGLENLLM LVYPVFGGRA INAVIAGEVW 
Q ALLYALWL LMWLVGAV RR IADTRTFTRI YTEIAVPVVL EQRQRQVPHS 
AVTARVALSR EFVSFFEEHL PI AATSWSI FGACIMLLVL E FWVGVSAVG 
ILALFLWLL P RFAAISENLY FRLNNSLERD NHFIRKGDRR QLYRHYGLLA 
RLRVLISNRE AFGY LCVGTA MGILFGFAFV M MTLKGYSSA GHVYSVGTYL 
WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERNIKAGT* 



51 
101 
151 
201 
251 



moo6-i/g006«i 95.5% identity in 288 aa overlap 



10 20 30 40 50 60 

m006-l.pep MWKMLKHIAQTHRKRLIGTFSLVGLENLLMLVYPVFGGRAINAVIAGEVWQALLYALWL 

> I 1 I I ! I M : I I M I I I I I E | I I I I! | I M I I I! I I I M I j I M : I I I I ! I j I I j I : 
g006-l MWPCMLKHIAKTHRKRLIGTFSPVGLENLLMLGYPVFGGWAINAVIAGRVWQALLYALVVF 

10 20 30 40 50 60 

70 80 90 100 110 120 

m006-l.pep LMWLVGAVRR I ADTRT FTRI YTE I AVPWLEQRQRQVPHSAVTARVALSREFVS FFEEHL 
I' I I I N : I I i I I I I! I! I I I I I I I I I I 1 | | | | ) | (I | M I I I I M I M II I I j I M I M 
g006-l LMWLVGAARRIADTRTFTRIYTE I AVPWLEQRQRQVPHSAVTARVALSREFVS FFEEHL 

7 ° 80 90 100 110 120 

130 140 150 160 170 180 

m006-l . pep P IAATS WS I FGAC I MLLVLEFWVGVSAVG I LALFLWLLPRFAAI SEN LY FRLNNSLERD 
1 ! I M t I I I I II I I I I I I I I I 11 I i I I ! I I I I | | I | | ! | | M I I I M I f I I I I I I I I I I I 
g006-l PIAATSVVSIFGACIMLLVLE FW VG VS AVG I L ALFLWLLPRFAA I SEN LY FRLNNSLERD 

130 140 150 160 170 180 

i9 ° 200 210 220 230 240 

m006-l.pep NHFIRKGDRRQLYRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 

> N I I I I I : j I I I I I I II :: I I I I I M I II j j I I I I I I : I I I I I I I I I j j I M l l I I : I E 
g006-l NHFIRKGDERQLYRHYGLVSRLRVLISNREAFGYLCVGAAMGILFGFAFVMMTLKGYGSA 

190 200 210 220 230 240 

250 260 270 280 289 

pep GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
M : M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I j M I I I I I I I I I I 
GHIYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
250 260 270 280 



m00 6-l, 



g006-l 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 35>: 

a006-l.seq (partial) 

1 ..AGCCAAAACC ACCGCAAGCG ATTGATTGGC ACATTTTTTC TGGTCGGACT 

51 GGAAAACCTT TTGATGCTGG TGTATCCGGT GTTTGGCGGC TGGGCGATTA 

101 ATGCCGTGAT TGCGGGGCAG GCGTGGCAGG CGTTGCTGTA CGCTTTGGTT 

151 GTGCTTTTGA TGTGGCTGGT CGGTGCGGCG CGGCGGATTG CCG AT ACGCG 

201 CACGTTTACG CGGATTTATA CCGAAATCGC CGTGCCGGTT GTGTTGGAAC 

251 AGCGGCAGCG GCAAGTCCCG CATTCGGCGG TAACTGCGCG GGTTGCCCTG 

301 TCGCGTGAGT TTGTCAGCTT TTTTGAAGAA CACCTGCCGA TTGCCGCGAC 

351 ATCCGTCGTA TCCATATTCG GCGCGTGCAT CATGCTGCTG GTGCTGGAAT 

4 01 TTTGGGTCGG CGTGTCGGCG GTGGGCATAC TTGCGTTGTT TTTATGGCTT 

4 51 TTGCCACGTT TTGCCGCCAT CAGCGAAAAC CTGTATTTCC GCCTGAAGAA 

501 CAGCTTGGAA CGCGACAACC ACTTTATCCG AAAAGGCGAC GAGCGGCAGC 

551 TGGACCGCCA TTACGGACTG CTTGCGCGCC TGCGTGTGCT GATTTCCAAC 

601 CGCGAAGCCT TCGGCTATCT CTGCGTCGGC ACGGCGATGG GTATTTTGTT 

651 CGGCTTTGCT TTTGTGATGA TGACGCTCAA AGGCTACAGC AGCGCGGGGC 



BNSDOCID: <WO 8957280A2J_> 



WO 99/57280 ^ PCT/US99/09346 
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701 ATGTCTATTC GGTCGGCACT TATCTGTGGA TGTTTGCCAT AAGTTTGGAC 

751 GACGTGCCGC GATTGGTCGA ACAATATTCC AATTTGAAAG ACATCGGACA 

801 ACGGATAGAG TGGTCGAAAC GGAACATCAA AGCCGGAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 36; ORF 006-1. a>: 

a006-l.pep (partial) 

1 . . SQNHRKRLIG TFFLVGLENL LMLVYPVFGG WAINAVIAGQ AW QALLYALV 

51 VLLMWLVGAA RRIADTRTFT RIYTEIAVPV VLEQRQRQVP HSAVTARVAL 

101 SREFVSFFEE HLPI AATSW SIFGACIMLL VL E FWVGVSA VGILALFLW L 

151 LPRFAAISEN LYFRLKNSLE RDNHFIRKGD ERQLDRHYGL LARLRVLISN 

201 REAFGY LCVG TAMGILFGFA FVM MTLKGY5 SAGHVYSVGT YLWMFAISLD 

251 DVPRLVEQYS NLKDIGQRIE WSKRNIKAGT * 

a006-l/m006-l 95.7% identity in 280 aa overlap 

10 . 20 30 40 50 

3006-1 • P e P SQNHRKRLIGTFFLVGLENLLMLVYPVFGGWAINAVIAGQAWQALLYALWL 

: I : M i M I I I I 1 I f I I 1 M II I I! I I M I M I ! I f I : : I [ I I I | | | j | | 
MWKMLKHIAQTHRKRLIGTFSLVGLENLLMLVYPVFGGRAINAVIAGEVWQALLYALWL 
10 20 30 40 50 60 



m006-l 



60 70 80 90 100 110 

a006-l . pep LMWLVGAARRIADTRTFTRIYTEIAVPWLEQRQRQVPHSAVTARVALSREFVSFFEEHL 
1 I M I II : ! I i I I M I I I ! M M I I I I I I i N I I M I I I M I [ I I I I [| i I II I I I I I I I 
LMWLVGAVRRIADTRTFTRIYTEIAVPVVLEQRQRQVPHSAVTARVALSREFVSFFEEHL 
70 80 90 100 110 120 



m006-l 



120 130 140 150 160 170 

a006-l.pep PIAATSWSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLKNSLERD 
M M 1 ! 1 ! * M 1 I I U I I I I I I I I I I I I I | | | | M M I I I I I M M M M M I : M f I I i 
m006-l PI AATS WS I FGAC IMLLVLEFWVGVSAVG I LALFLWLLPRFAAI SENL YFRLNNSLERD 

130 140 150 160 170 180 

180 190 200 210 220 230 

a006-l.pep NHFIRKGDERQLDRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 
M I I I II [: I I I I i I 1 f I ! I i IE I 1 I I I I I ] M I i I I I M I f ! i [ M I I ! I I M ! I I I I 
NHFIRKGDRRQLYRHYGLIARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 
190 200 210 220 230 240 



m006-l 



240 250 260 270 280 

a006-l . pep GHVYSVGTYLWMFAISLDDVPRLVEQYSNLKDIGQRIEWSKRNIKAGTX 

I I I I I I I I I I I I I I : I I I I I I I | | | | M | 11 I I I : I 1 I I I | | 

GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
250 260 270 280 



m006-l 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 37>: 

g007 . seq 

1 atgaACACAA CCCGACTGCC GACCGCCTTC 

51 CGCcGCTTCT GCCGccgaca acAGCatcat 

101 ACGAATCcAa ctGCATCGCC TGCCACGGCA 

151 ACTGCGtTTC CTccgctTTT CCggtcgGac 

201 cgTCCtgctg cacagcatgg tcaaaggcAt 

251 agcggcaaaa cctacgacgg atttatgCcc 

301 GGACATTGCC GCCGTCGCCA CT TAT AT CAT 



ATCTTGTGCT 
gaCaAAAGGG 
AGAAAGGGGA 
tgtattatga 
cgacgggaca 
gcaaccgcca 
GAACGCCTTT 



GCCTCTGcgC 
CAAAAAGTGT 
AGGGCGCGGC 
acaAACCGCa 
ttcaaagtgg 
tcagcgATGC 
GA 



This corresponds to the amino acid sequence <SEQ ID 38; ORF 007.ng>: 

g007.pep 

1 MNTTRLPTA F ILCCLCAAAS A ADNSIMTKG QKVYESNCIA CHGKKGEGRG 

51 TAFPPLFRSD CIMNKPHVLL HSMVKGIDGT FKVERQNLRR IYARNRHORC 

101 GHCRRRHLYH ERL* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 39>: 



BNSDOCID: <WO 9957280A2_I_> 
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m007 . seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA 

151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT 

251 AACGGCAAAA CCTACAACGG ATTCATGCCC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT 



GTCTTGGGCT 
GACAAAAGGG 
AAAAGGGCGA 
TTCATCATGA 
CAACGGTACA 
GCAACCGCCA 
GAACGCCTTT 



GCTTCTGCGC 
CAAAAAGTGT 
AGGCCGCGGA 
AAAAACCGCA 
ATCAAAGTC . 
TCAGCGATGC 
GA 



This corresponds to the amino acid sequence <SEQ ID 40; ORF 007>: 

m007 .pep 

1 MNTTRLPTAL VLGCFCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVXRQNLQR IHARNRHQRC 

101 GHCRRRHLYH ERL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 41>: 

a007 . seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG C AAAAAGTG T 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 

151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTC. 

251 AACGGCAAAA CCTACAACGG ATTCATGCCC GCCACTGCCA TCAGCGATGC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT GA 

This corresponds to the amino acid sequence <SEQ ID 42; ORF 007.a>: 

a007 .pep 

1 MNTTRLPTAL VLGCLCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVXRQNLQR IHARHCHQRC 

101 GHCRRRHLYH ERL* 



moo7/aoo7 97.3% identity over a 1 13 aa overlap 

10 20 30 40 50 60 

m007 . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
> I I M I I t I I I 1 f I : I I I M I I I M I I I I I I ! I I M | | | ] | j M j I I M I I I I I I j I i | | 
a007 MNTTRLPTALVLGCLCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

10 20 30 40 50 60 

70 80 90 100 110 

m007 . pep FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARNRHQRCGHCRRRHLYHERLX 
I I I I I I I I I I I I I I I I I I I I I I I I M I I II 1 I I I : I I I I I I I M I I I I I I I I I 
*007 FIMKKPQVLLHSMVKGINGT IKVXRQNLQR I HARHCHQRCGHCRRRHLYHERLX 

70 80 90 100 110 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 007 shows 86.7% identity over a 1 13 aa overlap with a predicted ORF (ORF 007.ng) 
from N. gonorrhoeae: 

m007/g007 

10 20 30 40 50 60 

m007 . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I I I I I I I I I :: I I : I I M I I I I I I I I I | | M I M I I I : II I I I j I I I M I I I I I : I I I 
<3° 01 MNTTRLPTAFILCCLCAAASAADNS IMTKGQKVYE SNC I ACHGKKGEGRGTAFP PLFRSD 

10 20 30 40 50 60 

70 80 90 100 110 

m007 . pep FIMKKPQVLLHSMVKGINGT IKVXRQNLQR I HARNRHQRCGHCRRRHLYHERLX 

1 1 : M : I I I I I M I I I : I I : I I I I I I : I I : I I I I I I I I I I I I I I I I I j j I I 
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g007 CIMNKPHVLLHSMVKGIDGTFKVERQNLRRIYARNRHQRCGHCRRRHLYHERL 

70 80 90 100 HO 

The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 43>- 

g007-l.seq (partial) ^ 

1 ATGAACACAA CCCGACTGCC GACCGCCTTC ATCTTGTGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCATCGCC TGCCACGGCA AGAAAGGGGA AGGGCGCGGC 

151 ACTGCGTTTC CTCCGCTTTT CCGGTCGGAC TATATTATGA ACAAACCGCA 

201 CGTCCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA AT CAAAGT CA 

251 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 

301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

351 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAGGC AAAAAAAAC . 

This corresponds to the amino acid sequence <SEQ ID 44; ORF 007-1 ng>* 

g007-l.pep (partial) ' B 

1 MNTTRLPTAF ILCCLCAAAS AADNSTMTire QKVYESNCIA CHGKKGEGRG 

51 TAFPPLFRSD YIMNKPHVLL HSMVKGINGT IKVNGKTYNG FM PAT A I S DA 

101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKG KKN. 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 45>- 

m007-l.seq ^ 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCTTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 

151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 

2 51 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 

301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

4 01 ^ GAAGCGTT ACCGR AAAAG ACGTAAAACA GGCAAAAAGC AAAAAAAACT 

This corresponds to the amino acid sequence <SEQ ID 46; ORF 007- 1> 

m007-l.pep ' 

1 MNTTRLPTAL VLGCFCAAAS AADN5 IMTKH QKVYESNCVA CHGKKGEGRG 
51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKS KKN* 



moo7-i / g oo7-i 91.7% identity in 133 aa overlap 



10 20 30 



^ u JU 40 50 fin 

IU007-1 . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

ftA , , lllllllll::! |:||| Ill I MINI I: Mill I II I • I I 7 

g007-l MNTTRLPTAFILCCLCAAASAADNSIMTKGQKVYESNCIACHG^GEGRGTAFPPLFRsi 

10 20 30 40 50 60 



70 80 90 100 no i?n 
m007-l.pep FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 
: 1 1 : 1 1 : 1 1 1 1 I " I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | || | | | | ID || | 



g007-l 



' i ' i ii I I I I II I II I I II I I I I I I I | | | | | I | || | I | I II I I r I I 

YIMNKPHVLLHS^KGINGTIKVNGKTYNGmPATAISDADIAAVATYI^AFDNGGGSV 
70 8° 90 100 110 120 



130 

m007-l . pep TEKDVKQAKS KKNX 

111111111:111 
g007-l TEKDVKQAKGKKN 

130 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 47>- 

a007-l.seq (partial) ^ 
1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCCTCTGCGC 
51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 
101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 
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151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA 

251 ACGGCAAAAC CTACAACGGA TTCATGCCCG CCACTGCCAT 

301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG 

351 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAAAC 



AAAAACCGCA 
ATCAAAGTCA 
CAGCGATGCG 
ACAACGGCGG 
AAAAAA. . 



This corresponds to the amino acid sequence <SEQ ID 48; ORF 007-l.a>: 

a007-l.pep (partial) 

1 MNTTRLPTAL VLGCLCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 
51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKN KK . . 

moo7-i/aoo7-i 98.5% identity in 132 aa overlap 

10 20 30 40 50 60 

m007-l.pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
M I I I I I I I I I I I I : I I M ! I I I I I I M I I I I I I I I I I I I I I I I | I f | | | | | | | | | | | | | 
a007-l MNTTRLPTALVLGCLCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

10 20 30 40 50 60 

"70 80 90 100 110 120 

m007-l .pep FIMKKPQVLLHSMVKGINGTIPCVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 
M I I M I I I I 1 I I ! I I I ! I I I I I I I I I 1 I I I I M M I I I I I I I I I E M ! I M I I 1 I I I M 
a007-l FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 

70 80 90 100 110 120 

130 

m007-l .pep TEKDVKQAKSKKNX 

111111111:11 
a007-l TEKDVKQAKNKK 

130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 49>: 

g008 . seq 

1 ATGAACAACA GACATTTTGC CGTCAtcgCC TTGGGCAGCA ACCTTGACAA 

51 CCCCGCACAA CAAATacgcg gcgcattaga cgcgctctcg tcccatcctg 

101 acatccggct tgaaCaggtt tcctcactgt aTatgaccgc acctgtcggt 

151 tacgAcaaTC agcccgATTT CATCaatgcc gTCTgcaccg TTTCCACCAC 

201 CtTGGACGGC ATTGcccTGC TTGCCgaACT CAAccgTATC GAAGCCGATT 

251 TCGGACGCGA aCGCAGTTTC CGCAATGCAC CGCGCACATT GGATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACCCCCGCC TTACCCTGCC 

351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TTGGCAGAAA 

4 01 TCCTCCCTGA TTTTATTTTG GGAAAATACG GAAAGGTTGT CGAATTGTCA 

4 51 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGACA GGTAA 

This corresponds to the amino acid sequence <SEQ ID 50; ORF 008.ng>: 

g008 .pep 

1 MNNRHFAVIA LGSNLDNPAQ QIRGALDALS SHPDIRLEQV SSLYMTAPVG 

51 YDNQPDFINA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDFIL GKYGKWELS 

151 KRLGNQGIRL LPDR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 51>: 

m008 .seq 

1 ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCTCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

101 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT TGTCAATGCC GTCTGCACCG TTTCCACCAC 

201 TCTGGACGGC ATTGCCyTGC TTGCCGAACT CAACCGTATC GAGGCTGATT 

251 TCGGACGCGA ACGCAGCTTC CGCAACGCGC CGCGCACATT GkATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACACsCGAC TcACCtTGCC 
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351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATCCGCCCT TTGGCAGAAA 
401 TCCTCCCTGA TTTTGTTTTA GGAAAACACG GAAAGGTTGC CGAATTGTCA 
451 AAACGGyTGG GCAATCAAGG TATCCGTCTT TTACCGGACA GGTAATT 

This corresponds to the amino acid sequence <SEQ ID 52; ORF 008>: 

m0O8.pep 

1 MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SSLYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLXLD 

101 IIDFDGISSD DTRLTLPHPR AHERSFVIRP LAEILPDFVL GKHGKVAELS 

151 KRLGNQGIRL LPDR* ^ s 

The following partial DNA sequence was identified in N. meningitidis<SEO ID 53>- 

a008. seq ^ 

1 ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCCCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

101 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT CGTCAATGCC GTCTGCACCG TTTCCACCAC 

2 01 CTTGGACGGC ATTGCCCTGC TTGCCGAACT CAACCGTATC GAAGCCGATT 

251 TCGGACGCGA ACGCAGCTTC CGCAACGCGC CGCGCACATT GGATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACCCCCGAC TCACCCTGCC 

351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TTGGCAGAAA 

4 01 TCCTCCCTGA TTTTATTTTG GGAAAACACG GAAAGGTTGC CGAATTGTCA 

451 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGATA AGTAA 

This corresponds to the amino acid sequence <SEQ ID 54; ORF 008.a>- 

a008.pep 

1 MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SSLYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDFIL GKHGKVAELS 

151 KRLGNQGIRL LPDK* 

a»O08/aoo8 97.6% identity over a 164 aa overlap 



n»008.pep 
a008 



10 20 30 40 50 60 

MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVGYDNQPDFVNA 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ' I I I I I I I M I I I I I I I | | | | | | | | 1 | | | | | | || | I I Mi | I,, , 

MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVGYDNQPDFVNA 
10 20 30 40 50 60 

70 80 90 100 110 120 

m008 . pep VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLXLDIIDFDGISSDDTRLTLPHPR 

'!''''! I I I I I I M I I I I I I I I I I I I I I I I I I I | | | | | | || | | | M | | I | || 

a008 VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLDLDIIDFDGISSDDPRLTLPHPR 

70 80 90 100 110 120 

130 140 150 160 

m008 . pep AHERSFVIRPLAEILPDFVLGKHGKVAELSKRLGNQGIRLLPDRX 

' 1 ' I I 'HI I I M : I I I I M I I I I I I I | | | | | | |||||:| 

a008 AHERS FVIR PLAE I LPDFI LGKHGKVAELSKRLGNQGIRLLPDKX 

130 140 150 160 

Computer analysis of this amino acid sequence gave the following results- 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 008 shows 92.7% identity over a 164 aa overlap with a predicted ORF (ORF008 ne) 
from N. gonorrhoeae: ' &J 

m008/g008 

m008. P ep ^-^QQ v ^*-^-?^.X-«DT^S Sii E*DICI<.i,I<Q.AS SL YI*4TA.E*'VG"lf DISJQ^D^VT^*.^ 

1 1 1 1 1 1 1 1 1 ' ' 1 I I 1 : I I : \ • I I I : I I I I I It 11 • I • M ii i m i i i i t i t i t i . . 
^NRHFAVIALGSNLDNPAQQIRGALDALSSHPDIRLEQVSSL 



BNSDOCID: <WO 9957280A2J_> 



WO 99/57280 




PCT/US99/09346 



10 20 30 40 50 60 

7 0 80 90 100 110 120 

m008 . pep VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLXLDIIDFDGISSDDTRLTLPHPR 

M I I I I I I I I I I I I I I I I I M | I | | | | Ill I I I | I | M | M | | | | | 

g008 VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLDLDIIDFDGISSDDPRLTLPHPR 

70 80 90 100 110 120 

130 140 150 160 

m008 . pep AHERSFVIRPLAEILPDFVLGKHGKVAELSKRLGNQGIRLLPDRX 
N M I I I I I I J t I E I II I : I I I : I M : I | | I | M I I } I I I I E I I | 
g008 AHERSFVIRPLAEILPDFILGKYGKWELSKRLGNQGIRLLPDRX 

130 140 150 160 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 55>: 

9009. seq 

1 ATGCCCCGCG CTGCCGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GCGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGAA ACCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC TCGTCGCCTT TTCGGATAAA GTGGTTGTcg cGTTCCAAGC 

2 01 TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AaaaGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 56; ORF 009.ng>: 

g009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARNQSVMAV 
51 QLPLVAFSDK WVAFQAWQ AEIQVFA DGG KTWQKP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 57>: 

m009. seq 

1 ATGCCCCGCG CTGCTGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC CGGTCGCCTT TTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

2 01 TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AAAAGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 58; ORF 009>: 

m009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARKQSVMAV 
51 QLPPVAFSDK WVAFQAWQ AEIQVFA DGG KTWQKP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 009 shows 97.7% identity over a 86 aa overlap with a predicted ORF (ORF 009.ng) 
from N. gonorrhoeae: 

m009/g009 



m009.pep 
g009 



m009.pep 
g009 



10 20 30 40 50 60 

MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPPVAFSDK 
M I I E I I I I I ! I 11 t E I I I I I I I i f t | | | 1 | | | | | | M j | | | | : | | | | | | | i j | m | | | 

MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARNQSVMAVQLPLVAFSDK 
10 20 30 40 50 60 

70 80 
WVAFQAWQ AE I QV FADGGKT WQKPX 
I I I I M I I I I I I I I i I I I I I I I I I I I | 
WVAFQAVVQAE I QV FADGGKT WQKPX 



BNSDOCID: <WO 9957280A2J_> 



WO 99/57280 — PCT/US99/09346 
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The following partial DNA sequence was identified in N. meningitidis <SEO ID 59>- 

a009.seq ^ 

^ o^^ CCGCG CTGCTGTAGG CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTC 

151 CAGCTGCCGC TCGTCGCCTT TTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

III HUgccSa a° GGAAATAC agg ttttcgc TGATGGCGGC AAAACGTGGC 

This corresponds to the amino acid sequence <SEQ ID 60- ORF 009 a>- 

a009.pep ' 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARKQSVMAV 
51 QLPLVAFSDK VWAFQAVLQ AEIQVFA DGG KTWQKP* 

moo9/aoo9 97.7% identity over a 86 aa overlap 

10 20 30 40 50 «n 

m009.pep ^ -^^^^^^^NQHTQ^VRKQS VT^AV'QI J F , PVAFS DFC 

' JLLi JJ 'Hi 1 1 1 1 1 I I I I I I I I I I I I I M I I I I I I [ | | | | | , | I , | I I M! I ! I I II I 



a009 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | , , ,, , 1 1 1 ,7V 

MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVI^VQLPLVAFSDK 
10 20 30 40 50 60 



70 80 
m009.pep WVAFQAWQAE I QV FADGGKT WQKPX 

I N I I I I I : I | | | | | | | | | | | | || | | | 
VVVAFQAVLQAE I QVFADGGKT WQKPX 
70 80 



a009 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 61>- 

gOlO.seq ^ * 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGCGG 

" ^ G ^ G Sr TA CGTGCAGCCC TCCAATTATC CAAATCCGGT TTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGCT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAGGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGCTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGTCGC GCTGCGCCTG AAGCGGTGAT TGAG^gJS 

301 CACATGGGTA TGCCTTTTGA CCGCGTTGAA AGCGGCAAAA TtJScISS 

351 TCCTTTCGGC GGACATACTG CCGAACATGG TAAACGTGCG GTAGAACGTG 

401 CATGTGCGGT TGCCGACCGT ACCGGTCATG CGATGTTGCA TACTTTGTAC 

451 CAACAAAACG TCCGTGCCAA TACACAATTC TTTGTGGAAT GGACGGCGCA 

ll\ ttllll^V CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC GGGCGAAGTT TATATTTTCC ACGCCAAGGC CGTGATGTTT 

601 GCTACCGGTG GCGGCGGTCG TATTTATGCT TCTTCTACCA ATGCTTATAT 

7H1 T^ll^ GACGGTTTGG CCATTTGCGC CCGTGCGGGC ATTCCGTTGG 

™" ^ll^lll A ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAacgc 

801 cgacggcgaA cgcTTTATGG AAcgctatgc GCcgACCGta aAaoaCTTrr 

851 CTTCTCGCga cgtGGTTTCA CgcJcGatgG CG.?g£2S 

901 cgcggctgTG GtaaAAAcaA agaCCacgtC TTACTGAAAA TCGACcAtAt 

mnJ ^r GCAGAA AAMTTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

L0Q1 TTCaartrnr rHCTZiTrrnT c^T\mm*» R/ , „ ~ ■ 



1001 
1051 



tgC = GG TATCGAT CCGATTAAAG ACCCGATTcc ggttgTGCCG 
ACTACCCACT ATATGATGGG CGGCATTCcg aCCAATTATC ACGGTGAAGT 
1101 TGTTGTTCCG CAAGGCGACG AGTACGAAG? ACCTGTAA^ GGCcSg 
a^*™^ GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGTTTGGGT 
1201 ACGAACTCCC TGCTGGACTT GGTGGTGTTC cgcccaaccc cccggtga 



This corresponds to the amino acid sequence <SEQ ID 62- ORF 010 ne>- 

gOlO.pep ' * 6 ' 



BNSDOCID: <WO_9957280A2J_> 



WO 99/57280 




PCT/US99/09346 



185 



1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDWGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDVVS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEWVP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLWF RPTPR* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 63>: 

mOlO . seq ( PARTIAL) 

1 . . nTCCAATTAT CCAAATCCGG TCTGAATTGT GCCGTTTTGT CTAAAGTGTT 

51 CCCGACCCGT TCGCATACCG TAGCGGCGCA GGGCGGTATT TCCGCCTCTn 

101 TGGGTAATGT GCAGGAAGAC CGTTGGGACT GGCACATGTA CGATACCGTG 

151 AAAGGTTCCG ACTGGTTGGG CGACCAAGAT GCGATTGAGT TTATGTGCCG 

201 CGCCGCGCCT GAAGCCGTAA TTGAGTTGGA ACACATGGGT ATGCCTTTTG 

251 ACCGTGTGGA AAGCGGTAAA ATTTATCAGC GTCCTTTCGG CGGCCATACT 

301 GCCGAACACG GTAAACGCGC GGTAGAACGC GyCTGTGCGG TTGCCGACCG 

351 TACAGGTCAT GCGATGCTGC ATACTTTGTA CCAACAAAAC GTCCGTGCCA 

4 01 ATACGCAATT CTTTGTGGAA TGGACGGCAC AAGATT TGAT TCGTGATGAA 

4 51 AACGGCGATG TCGTCGGCGT AACCGCCATG GAAATGGAAA CCGGCGAAgT 

501 TTATATTTTC CACGCTAAAG CTGTGATGTT TGCTACCGGC GGCGGCGGTC 

551 GTATTTATGC GTCTTCTACC AATGCCTATA TGAATACCGG CGATGGTTTG 

601 GGTATTTGTG CGCGTGCAGG TATCCCGTTG GAAGACATGG AATTCTGGCA 

651 ATTCCAGCCG ACCGGCGTGG CGGGTGCGGG CGTGTTGATT ACCGAA 

This corresponds to the amino acid sequence <SEQ ID 64; ORF 010>: 

mOlO.pep (PARTIAL) 

1 . .XQLSKSGLNC AVLSKVFPTR SHTVAAQGGI SASXGNVQED RWDWHMYDTV 

51 KGSDWLGDQD AIEFMCRAAP EAVIELEHMG MPFDRVESGK IYQRPFGGHT 

101 AEHGKRAVER XCAVADRTGH AMLHTLYQQN VRANTQFFVE WTAQDLIRDE 

151 NGDWGVTAM EMETGEVYIF HAKAVMFATG GGGRIYASST NAYMNTGDGL 

201 GICARAGIPL EDMEFWQFQP TGVAGAGVLI TE . . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 65>: 

aOlO.seq 

1 ATGGGCTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCANCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

401 CCTGTGCNGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 

4 51 CAACAAAATG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGAT GAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGCCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

701 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC AGGTGCGGGC 

7 51 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGG CGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGTATTCCG ACCAACTACC ATGGCGAAGT 

1101 TGTCGTTCCT CAAGGCGACG AATACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACGAACTCCC TGCTGGACTT AGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CATGATTAAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 
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1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 



ATGCCGGCGA 
GATGGTGAAA 
ATTGCACGCC 
GAGAAGTCAT 
AAGAGCAAAG 
CCTAATTGAA 
AATCACGCGG 
AACTGGATGA 
CAAACCGGTG 
AGCGCGTTTA 



ACTGACCCGC 
ACGTTGATGC 
GGCGTGTTCC 
GGCGATTGCC 
TGTGGAATAC 
GTGGCGAAAG 
TGCGCACGCT 
AACATACGCT 
CACACCAAGC 
TTGA 



CAACGTATCG 
ATTGCGCCGC 
G TACT GAT GA 
GAGCGTGTGA 
CGCGCGTATC 
CGACTTTGGT 
TCAGACGACC 
GTACCATTCA 
CTTTGAGCGT 



AGCGTTTGGA 
GAACTGCAAC 
GATTCTGAGC 
AACGTACCGA 
GAGGCTTTGG 
GTCTGCCGAA 
ATCCTGAGCG 
GATGCCAATA 
GGAATACATC 



CAATCAAACT 
GCTCCGTACA 
AAAGGCGTTC 
AATCAAAGAC 
AATTGGATAA 
GCACGTAAAG 
CGATGATGAA 
CCTTGTCCTA 
AAACCGGCCA 



This corresponds to the amino acid sequence <SEQ ED 66; ORF 010 a>* 

aOlO.pep 5 

MGFPVRKFDA VIVGGGGAGL RAXLQLSKSG LNCAVLSKVF PTRSHTVAAO 
GG IS AS LGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELF 
HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR tcSSSSJ 
QQNVRANTQF FVEWTAQDLI R DENG DWG V TAMEMETGEV YIFHAKAVMF 
ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 
VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDWS SSeSg 
RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PlS^WP 

tthymmggip tnyhgevwp qgdeyevpvk glyaagecac asvhgSg 

TNSLLDLWF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QrSSqT 
DGENVDALRR ELQRSVQLHA GVFRTDEILS KGVREVMAIA ERVKmSS 
KSKVWNTARI EALELDNLIE VAKATLVSAE ARKESRGAHA SDdSSSS 
NWMKHTLYHS DANTLSYKPV HTKPLSVEYI KPAKRVY* kUUHPERDDE 



51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



moio/aoio 98.7% identity over a 23 1 aa overlap 



mOlO.pep 
aOlO 

mOlO .pep 
aOlO 

mOlO .pep 
aOlO 

mOlO .pep 
aOlO 



10 20 30 

XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 

MGFPVRKFDAVIVGGGGAGLBU\XLQLSKSGLNCAVL^ 

10 20 30 40 50 60 



40 50 60 70 80 90 

QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGM^ 

70 80 90 100 no 120 



100 



110 



120 



130 140 150 

GHTAEHGKRAyERXCAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 
1111111111111 ' I ' 'I I I I I I I I I I I I | | | | | | | | | | | | | | | | I | I I I I | I | | | i i i 
GHTAEHGKRA VERACAVADRTGHAMLHTLYQQNVRANTQFFVE^ 

130 140 150 160 170 180 



160 



170 



180 



1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 ii 7 

TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLED 

190 200 210 220 230 240 

220 230 
mOlO.pep FQPTGVAGAG VL I TE 

I : I I I I I I I I I II I I 

FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDWSRAMAMEIYEG 
250 260 27 ° 280 290 300 



aOlO 



260 



Computer analysis of this amino acid sequence gave the following results: 



BNSDOCID: <WO 8957280A2J_> 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 010 shows 98.7% identity over a 231 aa overlap with a predicted ORF (ORF OlO.ng) 
from N. gonorrhoeae: 

xnOlO .pep/g010 .pep 

10 20 30 

YnOlO . pep XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 

I M I I I ! I I I I I I M | | | | | | | [ | j M | M | | | | | 
g010 MGFPVRKFDAVIVGGGGAGLRAALQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 

10 20 30 40 50 60 

40 50 60 70 80 90 

mOlO.pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 
I I I I I I I I I I I I I I I I I 1 I II M I I II I I I I I I I I I I f ! I I I I I I I I I I j I I i I | I i I I f 
9010 QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

70 80 90 100 110 120 

100 110 120 130 140 150 

mOlO . pep GHTAEHGKRAVERXCAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 

I M M M I I I I I I I I I I I I I I M M I I I I II I I I I I I I I I I I I I I 

gOlO GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 

130 140 150 160 170 180 

160 170 180 190 200 210 

mOlO . pep TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 

N M I I I I I I I I I I I I I I I II I I I M I I | M I II I II I M I M I I I I I I I M I I I I I I I I 
g 0 1 0 TAMEMETGE V Y I FHAKAVMFATGGGGR I YAS S TNAYMNTG DGLG I CARAG I PLE DMEFWQ 

190 200 210 220 230 240 

220 230 
mOlO . pep FQPTGVAGAGVLITE 
I : I I I I I I I I I I I M 

9 0 1 0 FH PT G VAGAGVL I TEGVRGEGG I LLNADGE R FMER YAPT VKDL AS RD WSRAMAME I YEG 

250 260 270 280 290 300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 67>: 

gOlO-l . seq. . 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGCGG 

51 TGCAGGTTTA CGTGCAGCCC TCCAATTATC CAAATCCGGT TTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGCT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAGGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGCTGGGC GACCAAGATG 

2 51 CGATTGAGTT TATGTGTCGC GCTGCGCCTG AAGCGGTGAT TGAGTTGGAA 

3 01 CACATGGGTA TGCCTTTTGA CCGCGTTGAA AGCGGCAAAA TTTATCAGCG 
351 TCCTTTCGGC GGACATACTG CCGAACATGG TAAACGTGCG GTAGAACGTG 

4 01 CATGTGCGGT TGCCGACCGT ACCGGTCATG CGATGTTGCA TACTTTGTAC 
4 51 CAACAAAACG TCCGTGCCAA TACACAATTC TTTGTGGAAT GGACGGCGCA 
501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 
551 AAATGGAAAC GGGCGAAGTT TATATTTTCC ACGCCAAGGC CGTGATGTTT 
601 GCTACCGGTG GCGGCGGTCG TATTTATGCT TCTTCTACCA ATGCTTATAT 
651 GAATACCGGT GACGGTTTGG GCATTTGCGC CCGTGCGGGC ATTCCGTTGG 
7 01 AAGATATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 
751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAACGC 
801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 
851 CTTCTCGCGA CGTGGTTTCA CGCGCGATGG CGATGGAAAT CTATGAAGGT 
901 CGCGGCTGTG GTAAAAACAA AGACCACGTC TTACTGAAAA TCGACCATAT 
951 CGGTGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTTGC CGGTATCGAT CCGATTAAAG ACCCGATTCC GGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCCG ACCAATTATC ACGGTGAAGT 

1101 TGTTGTTCCG CAAGGCGACG AGTACGAAGT ACCTGTAAAA GGCCTGTATG 

1151 CCGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGTTTGGGT 



BNSDOCID: <WO 9957280A2J_> 
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1201 ACGAACTCCC TGCTGGACTT GGTGGTGTTC cgcccaaccc cccggtga 

This corresponds to the amino acid sequence <SEQ ID 68; ORF 010-1 n e >- 

g010-l.pep ' • 6 • 

1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAO 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACaS SS 

\l\ ^^™ TQF FVEWTA 2 DLI RDENGDWGV TAMEMETGEV YIfSSSf 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVArAr 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDWS 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID pSSlPWP 

III TNSLLDLVVF SSS™ ™^ ~ ^ 

g010-l / P10444 

sp|P10444|DHSA_ECOLI SUCCINATE DEHYDROGENASE FLAVOPROTE IN SUBUNIT 
gnl|PID|dl015210 (D90711) Succinate dehydrogenase fl avm r«^^ r B v, • t . 

Identities = 191/303 (63%), Positives = 238/303 (78%) 
Query: 1 MGFPTOKFDAVIVXXXXXXX^^ 

„ . _ M PVR+ FDAV++ S+SG CA+LSKVFPTRSHTV+AQGGI + +I/3N 

Sb 3 Ot= 1 MKLPVREFDAVVIGAGGAGMRAALQISQSGQTCAIjLSKVFPTRSHTVSA^GITVALGNT 60 

0EDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYORPFG 
ED W+WHMYDTVKGSD++GDQDAIE+MC+ PEA ++ ELEHMG + PF R ++ G+IYQRPFG 

HEDNWEWHWYDTVKGSDYIGDQDAIEYMCKTGPEAILELEHMGIiPFSRIjDDGRIYQRPFG 

GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 180 
JL™ R A ADRTGHA+L HTLYQQN ++ + T F EW A DL ++ +++G WG 

GQSKNFGGEQAARTAAAADRTGHALLin'IiYQQNLKNHTTIFSEWYAIjDLVKNQDGAVVGC 18 

Query: 181 TAMEMETGEVYIFHAKRVMFATGGGGRIYASSTNAYMKTGDGLGICARAG1PLEDMEFWQ 24 

+ET6EV P A+A + ATGG GRIY S+TNA++NTGDG+G+ RAG+P++DME WO 
Sb^t: lf n ^^TGEVVYFKARATVIATGGAGRIYQSTT^ 24 „ 

FHPTGVAGAGVLITEGVRGEGGILUJADGERTOERYAPTVKDIASRDWSRAMAmiYEG 300 
FHPTG+AGAGVL+TEG RGEGG LLN GERFMERYAP KDLA RDwJ^T+EI EG 

FHPTGIAGAGVLVTEGCRGEGGYLLNKHGERFMERYAPHAKDIAGRDWARS 



Query: 61 QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 120 

ED W+WHMYDTVKGSD++GDQDAIE+MC+ PEA++ELEHMG+PF R++ G+IYQRPF 
Sb D Ct: 61 HE»il«Wffl«|>TVKOS^^ 120 

Query: 121 

* 121 _ ... ^vM^Mniinnnoi/IW UIlHilLini Jj Yl 11 JIM I .IfHUTTTPCW.TVRT nr f nn»._. 180 

inT T Oi\JfiV r M-KH + A'l\aG GRTY CxTWa^HTOnnn .r. . ~» - . ^ 

Sbjct: 181 

" x iwdi x wAn x wruuuvGMAI RAGVPVQDMEMWQ 24 0 

Query: 241 

Sbj Ct t 241 4. iwinunuvuv x d*jl_rttj£jljLj I J^iANK_Hr?FTT?T«'MT7DV'R DM7\ vnr * /-.i-.t^*~ ,„ ~ SIMIEIREG 300 

Query: 3 01 RGC 3 03 
RGC 

Sbjct : 301 RGC 303 

Score =24 9 (115.0 bits), Expect = 6.7e-169, Sum P(2) = 6.7e-169 
Identities = 53/102 (51%), Positives « 62/102 (60%) 

Query: 309 HVLLKIDHIGAEKIMEKLPGIREISIQFAGXXXX^ 

H LK+DH+G E + +LPGI E+S FA T HYMMGGTPT r 

Sb j ct : 310 HAKLKI^HIXSKEVLESR^ 

Query: 369 VPQGDEYEVPVKGLYAAGECACASVHGANRLGTNSLLDLWF 410 

+V V GL+A GE AC SVHGANRLG NSLLDLWP 
Sbjct: 370 TVNEKGEDWVPGLFAVGEIACVSVHGANRLGGNSLLDLWF 411 



368 



+V V GL+A GE AC SVHGANRLG NSLLDLWP 
ID WVPGLFAVGE IACVS VHGANRLGGNSLLDLWF 

The following partial DNA sequence was identified in K meningitidis <SEO ID 69>- 

m010-l . seq. . ~ " 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTrr 
51 TGCAGGTTTA CGCGCAGCCC TCCAATTATC CAAATCCGCT SSctG 



101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCrcIo 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAaScC SgGGACTG 

201 gcacatgtac gataccgtga aaggttccga ctggttgggc gIcSSg 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TG^gSS 

35 ScS?gS liccllllTc ^ TGTGGAA AGCG ^ 

^nT l^ZZZ GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

401 CCTGTGCGGT TGCCGACCGT ACAGGTCATG CGATGCTGCA SSSJctS 
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4 51 CAACAAAACG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGTCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

701 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCCG ACCAATTACC ACGGCGAAGT 

1101 TGTCGTTCCG CAAGGTGAAG AT TACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCTTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACCAACTCCC TGTTGGACTT GGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CATGATTAAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 

1301 AT G CAGGTG A GTTGACCCGC CAACGTATCG AGCGTTTGGA CAACCAAACC 

1351 GATGGTGAAA ACGTTGATGC ATTGCGTCGC GAACTGCAAC GCTCTGTACA 

14 01 ACTGCACGCC GGCGTGTTCC GTACTGATGA GATTCTGAGC AAAGGCGTTC 

14 51 GAGAAGTCAT GGCGATTGCC GAGCGTGTGA AACGTACCGA AATCAAAGAC 

1501 AAGAGCAAAG TGTGGAATAC CGCGCGTATC GAGGCTTTGG AATTGGATAA 

1551 CCTGATTGAA GTGGCGAAAG CGACTTTGGT GTCTGCCGAA GCACGTAAAG 

1601 AATCACGCGG TGCGCACGCT TCAGACGACC ATCCTGAGCG CGATGATGAA 

1651 AACTGGATGA AACATACGCT GTACCATTCA GATATCAATA CCTTGTCCTA 

1701 CAAACCGGTG CACACCAAGC CTTTGAGCGT GGAATACATC AAACCGGCCA 

17 51 AGCGCGTTTA TTGATGA 

This corresponds to the amino acid sequence <SEQ ID 70; ORF 010-1>: 

m0 10-1. pep. . 

1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI R DENG DWG V TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GG I LLNADGE RFMERYAPTV KDLASRDWS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEVWP QGEDYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLWF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QRIERLDNQT 

4 51 DGENVDALRR ELQRSVQLHA GVFRTDEILS KGVREVMAIA ERVKRTEIKD 

501 KSKVWNTARI EALELDNLIE VAKATLVSAE ARKESRGAHA SDDHPERDDE 

551 NWMKHTLYHS DINTLSYKPV HTKPLSVEYI KPAKRVY* 

m010-l / g010-l 99.5% identity in 410 aa overlap 



m010-l .pep 
gOlO-1 

mOlO-l.pep 
gOlO-1 

mOlO-1 .pep 
gOlO-1 

m010-l .pep 
g01O-l 



10 20 30 40 50 60 

MGFFVRKFDAVTVOTGGAGIJ*AAI£M 

I ■ > I I 1 I I I I It ! I I M I I I I I I I I I I M I I I 1 I II I I 1 I | | | M I I I I I I I I II I I | | | 
MGFPVRKFDAVIVGGGGAGIJIAAI^IuSKSGIjNC^ 

10 20 30 40 50 60 

70 80 90 100 110 120 

QEDRWDWHMYDTVKGSDWLGDQDAIEE^ICRAAPEAVIELE HMGMPFDRVE SGKIYQRPFG 

IMtillllMIIH I I I t I I I I I I f I I I 1 | J I | | | | | 1 t | l j | | | | 

QEDRMDWHMYDTVKG SDWLGDQDAI EFMCRAAFEAVIELEHMGMPFDRVE SGKI YQRPFG 

70 80 90 100 110 120 

130 140 150 160 170 180 

GHTAEHGKRAVERACAVADRTGHAMLHTI^YQQNVRAOT 

MMIIII 1 I t I I I I I 1 I I I t 1 I I I 1 I I f I I I 1 t 1 I I J I 1 t I f | | | | f | f I | I | | 

GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVE^ 

130 140 150 160 170 180 

190 200 210 220 230 240 

T AMEMETGEVY I FHAKAVMFATGGGGRI YAS S TNAYMNT GD GLG I CARAG I PUEDME FWQ 

IHIMMIII I I t I I 1 J I f I I 1 | l t t I I I I 1 I I I [ I | | | | | | | | | | | l i j | i t I 

TAMEMETGEVY IFHAKAVMFAT GGGGRI YAS S TNAYMNTGDGLGI GARAGI PLEDMEFWQ 
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190 200 210 220 230 



240 



250 260 270 280 290 300 

m010-l . pep FH P T GVAGAGVL I TE GVRGE GG ILLNAD GE RFME R Y APTVKD LASRD VVSRAMAME I YE G 

lllfltlllMttllllll||f||||||||||tI|||||i||,|,,,, mmM|||| 
g010 " 1 FHPT GVAGAGVLI TE GVRGE GGIIiLHAD (3ERJEMERYAPTVKDIASRDVVSRAMAME I YE G 

250 260 270 280 290 300 

310 320 330 340 350 360 

mOlO-l.pep R GCGKNKDHVIiKIDHIGAEKIMEKLPGIREISIQPAGIDPIKDPlPVVPTTHYMMGGlP 
Mn ^ 1 1 1 1 1 I M I I I I I I I 1 I I I I I I I I I I I I | M I I I M I I II I I I I | M M I I I I I I I I I | | 

g010-l RGCGKNKDHV1jI*KXDHIGAEKIMEKLPGIREISIQPAGIDP1KDPIPVVPTTHYMMGGIP 

310 320 330 340 350 360 

370 380 390 400 410 420 

m010-l .pep TNY H GE VVVPQGED YE VP VKGL YAAGE CACA S VH GANRLG TNSLOLDLVVFGKAAGD SMI K 

millllllll:: Mill II II IN I I Mill I II It II I I 

^° 1 0 " 1 TNYHGEVVVPQGDEYEVPVKGLYAAGECACASVHGANRLGTNSIJ^LVVr^ 

370 380 390 400 410 

430 440 450 460 470 480 

m010-l . pep FI KEQSDWKPLPANAGELTRQRIERI^NQTDGElTVroALRRELQRS VQLHAGVPRTDE II. S 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 71>- 

aOlO-l.seq. . ^ 

1 ATGGGCTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCANCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

4 01 CCTGTGCNGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 

4 51 CAACAAAATG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGCCG TATTTATGCG TCTTCTACCA AT GC CTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

7 01 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC AGGTGCGGGC 

7 51 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGG CGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGTATTCCG ACCAACTACC ATGGCGAAGT 

1101 TGTCGTTCCT CAAGGCGACG AATACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACGAACTCCC TGCTGGACTT AGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CATGATTAAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 

1301 ATGCCGGCGA ACTGACCCGC CAACGTATCG AGCGTTTGGA CAATCAAACT 

1351 GATGGTGAAA ACGTTGATGC ATTGCGCCGC GAACTGCAAC GCTCCGTACA 

14 01 ATTGCACGCC GGCGTGTTCC GTACTGATGA GATTCTGAGC AAAGGCGTTC 

14 51 GAGAAGTCAT GGCGATTGCC GAGCGTGTGA AACGTACCGA AATCAAAGAC 

1501 AAGAGCAAAG TGTGGAATAC CGCGCGTATC GAGGCTTTGG AATTGGATAA 

1551 CCTAATTGAA GTGGCGAAAG CGACTTTGGT GTCTGCCGAA GCACGTAAAG 

1601 AATCACGCGG TGCGCACGCT TCAGACGACC ATCCTGAGCG CGAT GATGAA 

1651 AACTGGATGA AACATACGCT GTACCATTCA GATGCCAATA CCTTGTCCTA 

1701 CAAACCGGTG CACACCAAGC CTTTGAGCGT GGAATACATC AAACCGGCCA 

17 51 AGCGCGTTTA TTGA 

This corresponds to the amino acid sequence <SEQ ID 72; ORF 010-l.a>* 

a010-l .pep. . 

1 MGFPVRKFDA VIVGGGGAGL RAXLQLSKSG LNCAVLSKVF PTRSHTVAAQ 
51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 
101 HMGMFFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 
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151 
201 
251 
301 
351 
401 
451 
501 
551 



QQNVRANTQF 
ATGGGGRIYA 
VLITEGVRGE 
RGCGKNKDHV 
TTHYMMGGIP 
TNSLLDLWF 
DGENVDALRR 
KSKVWNTARI 
NWMKHTLYHS 



FVEWTAQDLI 
SSTNAYMNTG 
GGILLNADGE 
LLKIDHIGAE 
TNYHGEWVP 
GKAAGDSMIK 
ELQRSVQLHA 
EALELDNLIE 
DANTLSYKPV 



RDENGDWGV 
DGLGICARAG 
RFMERYAPTV 
KIMEKLPGIR 
QGDEYEVPVK 
FIKEQSDWKP 
GVFRTDEILS 
VAKATLVSAE 
HTKPLSVEYI 



TAMEMETGEV 
IPLEDMEFWQ 
KDLASRDVVS 
EISIQFAGID 
GLYAAGECAC 
LPANAGELTR 
KGVREVMAIA 
ARKESRGAHA 
KPAKRVY* 



YIFHAKAVMF 
FHPTGVAGAG 
RAMAMEIYEG 
P1KDPIPWP 
ASVHGANRLG 
QRIERLDNQT 
ERVKRTEIKD 
SDDHPERDDE 



mOlO-1 / a010-l 99.3% identity in 587 aa overlap 

10 20 30 40 50 60 

a010-l .pep MGFPVRKFDAVT VGGGGAGLRAXLQLS KSGLNCAVLS KVF PTRSHTVAAQGG I S ASLGNV 

I M MIMM I II I Ml M III MIIMMM MM I MIMM Mil Mill , MM I 

aOl 0 - 1 MGF PVRKFDAVI VGGGGAGLRAALQLS KSGLNCAVLS KVF PTRSHTVAAQGG I S ASLGNV 

10 20 30 40 50 60 

70 80 90 100 110 120 

aOl 0 - 1 . pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKI YQRPFG 

M I M I II 1 1 1 1 1 1 II I M M M II 1 1 ! II 1 1 1 II 1 1 M I M M 1 1 1 1 II I M 1 1 1 1 1 1 I 

mOl 0 - 1 QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVI ELEHMGMPFDRVESGKI YQRPFG 

70 BO 90 100 110 120 

130 140 150 160 170 180 

aO 1 0 - 1 . pep GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 

i i 1 1 1 f r 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 ! 1 1 1 1 1 1 1 1 1 f I f 1 1 E I I 

mO 1 0 - 1 GHTAEHGKRAVERACAVADRTGHAMIiHTL YQQNVRANTQ F FVE WTAQD L I RDENGDWGV 

130 140 150 160 170 180 

190 200 210 220 230 240 

aOlO - 1 . pep TAMEMETGEVYIFHAKAVMFATGGGGRI YASSTNAYMNTGDGLG I CARAG IPLEDMEFWQ 

1 f I M I i 1 1 1 1 M I E 1 1 1 1 i I i I i 1 i I M 1 1 f r I i 1 1 1 ! 1 1 i 1 1 1 M 1 1 1 1 1 1 1 i r 1 1 1 i 

m01O-l TAMEMETGEVYI FHAKAVMFATGGGGR I YASSTNAYMNTGDGLGI CARAG I PLEDMEFWQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

aO 1 0 - 1 . pep FH P TG VAGAG VL I TEGVRGEGG I LLNADGERFMER YAPT VKDLAS RD WSRAMAME I YEG 

1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 f 1 1 1 i 1 1 1 1 1 1 1 1 r r i r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

m010-l FH PTGVAGAGVL I TEGVRG EGG I LLNADGERFMERYAPTVKDLASRD WSRAMAME I YEG 

250 260 270 280 290 300 

310 320 330 340 350 360 

a01O-l .pep RGCGKNKDHVLLKIDHIGAEKIMEKLPGIREISIQFAGIDPIKDPIPWPTTHYMMGGIP 

M MiMMI MIIIMM I MMMM MM Mill IIIMIIIMM MM M !M M 

m010-l RG CGKNKDHVLLKI DHI GAEKI MEKLPG I RE I S I QFAG ID P I KDP I P WPTTH YMMGG I P 

310 320 330 340 350 360 

370 380 390 400 410 420 

aO 1 0 - 1 . pep TNYHGEWVPQGDEYEVPVKGLYAAGECACASVHGANRLGTNSLLDLWFGKAAGDSMI K 

M MMMMIMMI Mil IIIIIMI MIMM II MMMMIMI MM M I M II 

mO 1 0 - 1 TNYHGEVWPQGEDYEVPVKGLYAAGECACASVHGANRLGTNSLLDLWFGKAAGDSMI K 

370 380 390 400 410 420 



430 440 450 460 470 480 

aO 1 0 - 1 . pep FI KEQSDWKPLPANAGELTRQR I ERLDNQTDGENVDALRRELQRSVQLHAGVFRTDE I LS 

M I IIIMIIIIII IMMI IIIIIIMMII III MIIMMIMIIIMM II ! M II 

mOlO - 1 FIKEQSDWKPLPANAGELTRQRIERLDNQTDGENVDALRRELQRSVQLHAGVFRTDEILS 

430 440 450 460 470 480 

490 500 510 520 530 540 

a010-l .pep KG VREVMA I AERVKRTE I KDKS KVWNTAR I EALELDNL I E VAKATLVSAE ARKES RGAHA 

II Mill MM III II I i I MM MMI MM MM MM IMM MM II IMI I II I | 

mOlO- 1 KG VREVMA I AERVKRTE I KD KS KVWNTAR I EALELDNL I E VAKATLVS AEARKE S RGAHA 

490 500 510 520 530 540 

550 560 570 580 

aO 1 0 - 1 . pep S DDH PERDDENWMKHTLYHS DANTLS YKPVHTKPLSVE Y I KPAKRVYX 

I I Mill MM III Mill M M Mi II II MM I MM Ml M IM 
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n»010 - 1 SD DH PERDDENWMKHTL YHSD I NTLS YKPVHTKPLS VE Y I KPAKR VYX 

550 560 570 580 

The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 73>- 

gOll . seq ^ 

1 ATGAAGACAC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG GACATCATGA 

101 GCCTGAAAAC CCGCCTTACC GAAGATATGA AAACCGCGAT GCGCGCCAAA 

151 GATCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAATGCCG CCGTCAAACA 

201 GTTTGAAGTA GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

251 TCCTGACCAA AATGGTCAAA CAGCGCAAAG ACGGCGCGAA AATCTACACT 

301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGACGT 

351 GCTGCACCGC TACCTGCCGC AAATGCTCTC CGCCGGCGAA ATCCGCACCG 

4 01 CCGTCGAAGC AGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 

4 51 GGCAAAGTGA TGGTCGTATT GAAAAcccGC CTCGCCGGCA AAGccgATAT 

501 GGGCGAAGTC AACAAAATCT TGAAAAccGt aCTGACCGCC tga 

This corresponds to the amino acid sequence <SEQ ID 74; ORF 01 l.ng>: 

gOll.pep ' 6 

1 MKTHRKTCSA VCFAFQTASK PAVSIRHPSE DIMSLKTRLT EDMKTAMRAK 

51 DQVSLGTIRL INAAVKQFEV DERTEADDAK ITAILTKMVK QRKDGAKIYT 

101 EAGRQDLADK ENAEIDVLHR YLPQMLSAGE IRTAVEAAVA ETGAAGMADM 

151 GKVMWLKTR LAGKADMGEV NKILKTVLTA * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 75>- 

mOll.seq (partial) ^ 

1 AT GAGGACAC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG GACATCATGA 

101 GCCTGAAAAT CCGCCTTACC GAAGACATGA AAACCGCGAT GCGCGCCAAA 

151 GACCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAACGCCG CCGTCAAACA 

2 01 GTTTGAAGTG GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

2 51 TCCTGACCAA AATGGTCAAA CAGCGAAAAG ACAGCGCGAA AATCTACACT 

301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGAGGT 

351 ACTGCACCGC TACCTTCCCC AAATGCTTTC CGCCGGCGAA ATCCGTACCG 

4 01 AGGTCGAAGC TGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 

4 51 GGTAAAGTCA TGGGGCTGCT GAAAACCCGC CTCGCAGGTA AAGCCGA. 

This corresponds to the amino acid sequence <SEQ ID 76; ORF 01 1>: 

mOll.pep (partial) 

1 MRTHRKTCSA VCFAFQTASK PAVSIRHPSE DIMSLKIRLT EDMKTAMRAK 

51 DQVSLGTIRL INAAVKQFEV DERTEADDAK ITAILTKMVK QRKDSAKIYT 

101 EAGRQDLADK ENAEIEVLHR YLPQMLSAGE IRTEVEAAVA ETGAAGMADM 

151 GKVMGLLKTR LAGKA 

Computer analysis of this amino acid sequence gave the following results- 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 01 1 shows 95.8% identity over a 165 aa overlap with a predicted ORF (ORF 01 1 nel 
from N. gonorrhoeae: ' BJ 

mOll/gOll 



10 20 30 40 50 60 

MRTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKIRLTEDMKTAMRAKDQVSLGTIRL 

"Mill Mill MINIM MINI UNI Mill MINIMUM II mm 

MKTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKTRLTEDMKTAMRAKDQV^ 

10 20 30 40 50 60 

70 80 90 100 no 120 

IN^VKQFEVDERTEADDAKITAILTKMVKQRKDSAKIYTEAGRQDLADKENAEIEVLHR 
anil ILL M I I I I II II M II II II I I I I I | I | || I II : II II I I I I | || U I U I m Tt7? 

9011 INAAVKQFEVDERTEADDAKITAILTKMVKQRKDGAKIYTEAGRQDLADK^ 



mOll . pep 
gOll 

iuOll .pep 
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mOll . pep 



gOll 



130 140 150 160 

YLPQMLSAGEIRTEVEAAVAETGAAGMADMGKVMGLLKTRLAGKA 
I I II I I I I I I I I I ! M I I I I I I I I It I I I M I I : I f I [ I I I I I 

YLPQMLSAGEIRTAVEAAVAETGAAGMADMGKVMWLKTRLAGKADMGEVNKILKTVLTA 



130 140 150 160 170 180 



gOll 



X 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 77>: 

g012 . seq 



1 ATGCTCGCCC GTCGCTATTT TTTCAATATC CAACCCGGGG CGGTTTTCAC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGCCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACa 

201 gGcggTGGAT ATTCGgcact tccgCcacca cacccaccga accgatgacc 

251 gcaaacggaG CGGAAACAAT TTTATCCGCc acacacgcca tcatatagcc 

301 gcCGCTTGCC GCGACCTTAT CGAcggcgac ggTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CCTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

4 01 CCGCCCGGAC TTTCCAATCT GAGCAGAACC TCATCTTCAG GCTTGGCAAT 

4 51 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GCAGATTTCT CCCCGCCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CTTTTTTTTC CTGATGTTTT GTCTCTTCCT 

651 CGCTTAA 



This corresponds to the amino acid sequence <SEQ ID 78; ORF 012.ng>: 

g012 .pep 



1 MLARRYFFNI QPGAVFTDKL LEQLMRFLQF LPEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRHFRHHTHR TDDRKRSGNN FIRHTRHHIA 

101 AACRDLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS EQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPAL LQTLFLCFGF 

201 RLFLFLFFFF LMFCLFLA* 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 79>: 



1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

201 GGCGGTGGAT ATTCGGTACT TCCGCCACCA CACCCACCGA ACCGACAATC 

251 GCAAACGGAG CGGAAGCAAT TTTATCCGCC ACACACGCCA TCATATAACC 

301 GCCGCTCGCn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

351 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

4 01 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

4 51 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

501 nnnnnnnnnn nnnnnnnnnC AACACAAAAA GGCGTGATTT nTGCGTTTCG 

551 GCAGATTTCT CCCCACCCTC CTTCAAACGT TTTTCcTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTGT GCCTCTTCCC 

651 CGCTTAA 



This corresponds to the amino acid sequence <SEQ ID 80; ORF 012>: 

m012 .pep 



1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR TDNRKRSGSN FIRHTRHHIT 

101 AARXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XXXQHKKA* F XRFGRFLPTL LQTFFLCFGF 

201 RLFLFLFLFF LMLCLFPA* 



m012 . seq 
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The following partial DNA sequence was identified in N. meningitidis <SEO ID 81>- 

a012.seq ^ 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

201 GGCGGTGGAT ATTCGGTACT TCCGCTACAA CACCCACCGA ACCGACAATC 

2 51 GCAAACGGAG CGGAAACAAT TTTATCCGCC ACACACGCCA TCATATAACC 
301 ACCGCTCGCC GCCACCTTAT CGACGGCGAC GGTCAGCGGA ATATTGCGTT 

3 51 CGCGCAAACG CCTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

4 01 CCGCCCGGAC TTTCCAATCT AAGCAGAACC TCATCTTCAG GCTTGGCAAT 
4 51 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 
501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 
551 GAAGATTTCT CCCCACCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 
601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTTT GCCTCTTCCP 
651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 82; ORF 012 a>* 

a012.pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR I FTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDNRKRSGNN FIRHTRHHIT 

101 TARRHLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS KQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 

201 RLFLFLFLFF LMFCLFPA * V WW 

moi2/aoi2 64.2% identity over a 21 8 aa overlap 

10 20 30 40 50 fin 

ra0!2 . pep ™CHFLM I QLRAVLADKLLEQLMRFLQFLSEFLFALFR I FTHKSNRALKFARRHHIHI 

a0!2 MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLF 

10 20 30 40 50 60 



70 80 90 100 HO ion 

m012.pep ^ ^T'T"??^^^^^ ^^T^^^^T*^^^' ^ ^-^^-RX^^>^X>C^>CX^X^>C^C>CXX^ 

M M II I I I | I | I M : : j M I I I i I I 1 f : | I) I M i I I I I : | | 

NIMFFQQAVDIRYFRYNTHRTDNRKRSGNNFIRHTRHHITTARRHLIDGDGQRNIAFAQT 
70 80 90 100 HQ 120 



a012 



130 140 150 160 170 1 Rn 

m012 . pep ^XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAXF 

PKLRSRQTVTVNHAARTFQSKQNLIFRLGNQKHRRNmTQGFYGVCIQIAVKIQH^GF 
130 140 150 160 170 180 



a012 



190 200 210 219 

m012 . pep XRFGRFLPTLLQTFFLCFGFRLFLFLFLFFLMLCLFPAX 
IMMil[iMI:|||IIIIIII||MM|f:MMII 
LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
190 200 210 



a012 



Computer analysis of this amino acid sequence gave the following results' 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 012 shows 58.7% identity over a 218 aa overlap with a predicted ORF (ORF 012 nirt 
from N. gonorrhoeae: v ^.ug; 

m012/g012 



mO 1 2 . pep MLARCHFLNT OT -DnTTT * IWT T i-^* — — - 4 0 50 50 



10 20 30 

„ n - 0 11 N ■'"IN II : : I I I I I I I I I J M 1 1 M I I I I I M I j I M I I I I I I I I r 

g012 MLARRYFraiQPGAVFTDKLLEQLMRFLQFLPEFLFALFRIFTHKSN 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m012 . pep NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARXXXXXXXXXXXXXXXXX 
M II I I f I I I I I : i I I I I I I I I : I I M I : I If I I I I I I I : I I : 
g012 NIMFFQQAVDIRHFRHHTHRTDDRKRSGNNFIRHTRHHIAAACRDLIDGDGQRNIAFAQT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m012 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAXF 

: : : : MINI 

g012 PKLRS RQT VT VNHAART FQSEQNLI FRLGNQKHRRNLMTQG FYGVC I Q I AVK I QHKKAG F 

130 140 150 160 170 180 

190 200 210 219 

m012 . pep XRFGRFLPTLLQTFFLCFGFRLFLFLFLFFLMLCLFPAX 
I I I I I M : I I II : I I I I I I I I I II I I : I I I I : I I I II 
g012 LRFGRFL PALLQTL FLC FG FRL FL FLFFFFLMFC L FLAX 

190 200 210 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 83>: 

m012-l . seq 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

201 GGCGGTGGAT ATTCGGTACT TCCGCCACCA CACCCACCGA ACCGACAATC 

251 GCAAACGGAG CGGAAGCAAT TTTATCCGCC ACACACGCCA TCATATAACC 

301 GCCGCTCGCC GCCACCTTAT CGACGGCGAC GGTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CyTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

4 01 CCGCCCGGAC TTTCCAATCT GAGCAGAACC TCATCTTCAG GCTTGGCAAT 

4 51 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GCAGATTTCT CCCCACCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTTT GCCTCTTCCC 
■ 651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 84; ORF 012-1>: 

m012-l .pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR TDNRKRSGSN FIRHTRHHIT 

101 AARRHLIDGD GQRNIAFAQT XKLRSRQTVT VNHAART FQS EQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VK I QHKKAG F LRFGRFL PTL LQTLFLCFGF 

201 RLFLFLFLFF LMFCLFPA* 



m012-l/g012 91.7% identity in 218 aa overlap 



10 20 30 40 50 60 

m012-l . pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
I II I : I : I I I I I : : M I I I I I I M E I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I i | 
g012 MLARRYFFNIQPGAVFTDKLLEQLMRFLQFLPEFLFALFRIFTHKSNRALKFARRHHIHI 

10 20 30 40 50 60 



70 80 90 100 110 120 

m012-l.pep NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARRHLIDGDGQRNIAFAQT 
I I I I I I I I I I I I : ! I I I I I I I I : M I I |-: I I II I I I M I : I } I 1 I I I I I I I I I M I I I 
g012 NIMFFQQAVDIRHFRHHTHRTDDRKRSGNNFIRHTRHHIAAACRDLIDGDGQRNIAFAQT 

70 80 90 100 110 120 



130 140 150 160 170 180 

m012-l .pep XKLRSRQTVTVNHAARTFQSEQNLI FRLGNQKHRRNLMTQG FYGVC IQIAVKIQHKKAGF 
I I I I I I M I I I I I I I I I I I I I I I I I M I I ! I I I M I I ! II E I I I 1 I II I I I I M I I I I I 
g012 PKLRSRQTVTVNHAARTFQSEQNLI FRLGNQKHRRNLMTQG FYGVC IQIAVKIQHKKAGF 
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130 140 150 160 170 180 

190 200 210 219 

m012-l . pep LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
N I I I I M : i I M i I I I I I f | j f M I I : f | | j | | | j [ j 
9012 LRFGRFLPALLQTLFLCFGFRLFLFLFFFFLMFCLFLAX 

190 200 210 

I012 ^° llowing P artial DNA sequence was identified in N. meningitidis <SEQ ID 85>: 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

201 GGCGGTGGAT ATTCGGTACT TCCGCTACAA CACCCACCGA ACCGACAATC 

251 GCAAACGGAG CGGAAACAAT TTTATCCGCC ACACACGCCA TCATATAACC 

301 ACCGCTCGCC GCCACCTTAT CGACGGCGAC GGTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CCTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

4 01 CCGCCCGGAC TTTCCAATCT AAGCAGAACC TCATCTTCAG GCTTGGCAAT 

4 51 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACARATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GAAGATTTCT CCCCACCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTTT GCCTCTTCCC 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 86; ORF 012-l.a>- 

a012-l.pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDNRKRSGNN FIRHTRHHIT 

101 TARRHLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS KQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 

201 RLFLFLFLFF LMFCLFPA * 

a012-l/m012-l 97.2% identity in 218 aa overlap 

10 20 30 40 50 60 

MLARCHFLNI QLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
M ' ' ' M I ! I I I I 1 I I I J M I M I I I I | | I I I I I | | | | | | I I I I I I I | | i | | || I I 
MIARCHFLNIQLRAVIADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
10 20 30 40 50 60 

70 80 90 100 110 120 

NIMFFQQAVDIRYFRYNTHRTDNRKRSGNNFIRHTRHHITTARRHLIDGDGQRNIAFAOT 
•"'ll Hill I Ml ::l II II I Ml || : in, mm,, l: ,,, ||M ,,,7,,!"*^ 

NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARRHLIDGDGQRNIAFAOT 
70 80 90 100 HO 120 

130 140 150 160 170 180 

PKLRSRQTVT VNHAART FQSKQNL I FRLGNQKHRRNLMTQGFYGVC I Q I AVKIOHKKAGF 

"I NIMIIIIM = I I I II I I I M I I I I I I M I I I I I I I Ml I I I I I MM?T 

XKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 
130 140 150 160 170 180 



a012-l .pep 
m012-l 

a012-l .pep 
m012-l 

a012-l.pep 
m012-l 



190 200 210 219 

a012-l.pep LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
I I I I M I II I I I I M | ! I I M I I I I | || 1 I I I I I I I | 
LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
190 200 210 



m012-l 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 87>- 

g013.seq ^ 

1 aTgcctttga ccatgctgtg cagcaGGAcg tGCGGTTtgt tcataataca 

51 gtCcgaccGG AAAagcggAG GAAaCGCAGT GCCGCGCCCT TCCCCTTTCT 

101 TGCCGTGGCA GGCGATGCag tTgGATTCGT ACACTTTTTG CCCTTTtGtc 
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151 atgatGCTgt tgtcggCGGC AGAAGCgGCG GcgCAGAGGC AGCACAAGAT 

201 GAAGGCGGTC GGCAGTCGGG TTGTGTtcat tGgcgTTTCC cctaatgttt 

251 tgaaaccttg ttttttgatt Ttgcctttac ggggtgaaaa gtttttTtgg 

301 cccaaatccg gaatttag 

This corresponds to the amino acid sequence <SEQ ID 88; ORF 013.ng: 

g013 .pep 

1 MPLTMLCSRT CGLFIIQSDR KSGGNAVPRP SPFLPWQAMQ LDSYTFCPFV 
51 MMLLSAAEAA AQRQHKMKAV GSR VVFIGVS PNVLKPCFLI LPLRGEKFFW 
101 PKSGI* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 89>: 

m013 . seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT TCATGATGAA 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAAGC AGCCCAAGAC 

201 GAGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTCATGTTTG 

251 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAAGATTTT TTTGCCGAAT 

301 CAGTAG 

This corresponds to the amino acid sequence <SEQ ID 90; ORF 01 3>: 

m013 .pep 

1 MPLTMLCSST CGFFMMKSER XSGGNMVPRP SPFLPWQATQ LDSYTFCPFV 
51 MMLLSAAEAA AQKQPKTRAV GSR VVFIGVS FMFETLLLI L RSGXKIFLPN 
101 Q* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 91>: 

a013 . seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT TCATGATGAA 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAGGC AGCCCAAGAC 

201 GAGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTAATGTTTG 

251 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAAGATTTT CTTGCCGAAT 

3 01 CGGTAG 

This corresponds to the amino acid sequence <SEQ ID 92; ORF 013.a>: 

a013.pep 

1 MPLTMLCSST CGFFMMKSER *SGGNMVPRP SPFLPWQATQ LDSYTFCPFV 
51 MMLLSAAEAA AQRQPKTRAV GSR VVFIGVS LMFETLLLIL RSG*KIFLPN 
101 R* 

mOi3/aoi3 97.0% identity over a 101 aa overlap 

10 20 30 40 50 60 

m013 . pep MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 

I I I I I I I I I I M I i f I I ! I I I I I I ! I 11 M I I I I I I I I I I I I I II I I I II I I I I 

a0l3 MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 

10 20 30 40 50 60 

70 80 90 100 

m013. pep AQKQPKTRAVGSRVVFIGVSFMFETLLLILRSGXKIFLPNQX 
M : I I II I I I I i I I I I I I I I : II M M M I M I I I I II I I : I 
a013 AQRQPKTRAVGSRWFIGVSLMFETLLLILRSGXKIFLPNRX 

70 80 90 100 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 013 shows 73.3% identity over a 101 aa overlap with a predicted ORF (ORF 013.ng) 
from N. gonorrhoeae: 



BNSOOCID: <WO 98S72BQA5 I > 



WO 99/57280 J| PCT/US99/09346 
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m013/g013 

10 20 30 40 50 60 

m013 . pep MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 
111,1111 I M : I : = I : I I I II I M 1 I M I I M I I I I! i I I I ! 1 | ! i | |f I M ! I 
MPLTMLCSRTCGLFIIQSDRKSGGNAVPRPSPFLPWQAMQLDSYTFCPFVMMLLSAAEAA 
10 20 30 40 50 60 

70 80 90 100 

m013.pep AQKQPKTRAVGSRVVFIGVSF-MFETLLLILR-SGXKIFLPNQX 

i I • I I : I I I I I i I I I ! I I : : : : M I i I : I | : 
9013 AQRQHKMKAVGSRVVFIGVSPNVLKPCFLILPLRGEKFFWPKSGIX 

70 80 90 100 



g013 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 93>- 
gOlS.seq ^ 

1 ATGCAGTATC TGATTGTCAA ATACAGCCAT CAAATCTTCG TTAC CATCAC 

51 CATTTTGGTA TTCAACATCC GTTTTTTCCT ACTTTGGAAA AAT CCAGAAA 

101 AGCCCTTGGT CGGCTTTTGG AAAGCACTGC CCCACCTCAA CGACACGATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAG ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGCA CAAAAATCCT GCTCCTGTTC GCCTACATCG 

251 CACTGGG CAT GGTAATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

3 01 ACCGTTTACC TGCTCGCTAT GTGTTGCATC GCCTGCATCG TTTACCTTGC 

351 CAAAACCAAA GTCCTGCCAT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 94; ORF 015 ne>* 

gOlS.pep 5 B 

1 MQYLIVKYSH Q IFVTITILV FN I RFFLL WK NPEKPLVGFW KALPHLNDTM 

51 LLFTGLWLMK ITHFSPFNAP WLGT KILLLF AYIALGMVMM R ARPRSTTTPV 

101 TVYItliAMCC I AC I V YLA KT K VLPF*~ " 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 95>- 

mois.seq (partial) v ' 

1 . . AAAATCAGAA AAGCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA 

51 CGACACCATG CTGCTGTTTA CGGGATTGTG GCTGATGAAA ATTACCCATT 

101 TCTCCCCGTT CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC 

151 GCCTATATCG CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC 

201 CAAGTTCTAC ACCGTTTACC TGCTCGCCAT GTGTTGCGTC GCCTGCATCG 

251 TTTACCTTGC CAAAACCAAA GTCCTGCCTT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 96; ORF 015: 

mOlS.pep (partial) 

1 . .KIRKALAGFW KALPHLNDTM LLFTGLWLMK ITHFSPFNAP WLGTKILLLL 
51 AY I ALGMMMM RA RPRSTKFY TVYLLAMCCV ACIVYLA KTK VLPF*" 

The following partial DNA sequence was identified in K meningitidis <SEO ID 97>- 

aOlS.seq ^ 

1 ATGCAGTATC TGATTGTCAA ATACAGCCAT CAAATCTTCG TTACCATCAC 

51 CATTTTGGTA TTCAACATCC GTGTTTTCNT ACTTTGGAAA AATCCAGAAA 

101 AGCCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA CGACACCATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAA ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC GCCTATATCG 

251 CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

301 ACCGTTTACC TGCTCGCCAT GTGTTGCCTC ACCTGCATCG TTTACCTTGC 



351 CAAAACCAAA GTCCTGCCTT TCTGA 

iponc 

aOlS.pep 



This corresponds to the amino acid sequence <SEQ ID 98; ORF 015 a>* 

aOlS.pep ' 
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1 MQYLIVKYSH Q IFVTITILV FNIRVFXL WK NPEKPLAGFW KALPHLNDTM 
51 LLFTGLWLMK ITHFSPFNAP WLGTK ILLLL AYIALGMMMM RA RPRSTKFY 
101 TVYLLAMCCL TCIVYLA KTK VLPF* 

mois/aois 96.7% identity over a 91 aa overlap 

10 20 30 

KIRKALAGFWKALPHLNDTMLLFTGLWLMKITH 
I I I I I I I I I I I I I I M I 1 I I I I I I I I I I I 
LIVKYSHQIFVTITILVFNIRVFXLWKNPEKPLAGFWKALPHLNDTMLLFTGLWLMKITH 
10 20 30 40 50 60 

40 50 60 70 80 90 

FSPFNAPWLGTKILLLLAYIALGMMMMRARPRSTKFYTVYLLAMCCVACIVYLAKTKVLP 
I I I I t I M I I M I I I I I I II I I I I I I I I M I I II I I II I i I I II It : : I M I I I I I I I I I 
FSPFNAPWLGTKILLLLAYIALGMMMMRARPRSTKFYTVYLLAMCCLTCIVYLAKTKVLP 
70 80 90 100 110 120 



mOlS.pep 
a015 

m015 .pep 
a015 



m015.pep FX 
I I 

a015 FX 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 015 shows 94.5% identity over a 91 aa overlap with a predicted ORF (ORF 015.ng) 
from N. gonorrhoeae: 

m015/g015 

10 20 30 

mO 15 . pep KIRKALAGFWKALPHLNDTMLLFTGLWLMKITH 

I hllllMIIIIIIIIIMIIMIIIM 

g015 LIVKYSHQIFVTITILVFNIRFFLLWKNPEKPLVGFWKAIiPHLNDTMLLFTGLWLMKITH 
10 20 30 40 50 60 

40 50 60 70 80 90 

mO 1 5 . pep FSPFNAPWLGTKI LLLIAYIALGMMMMRARPRSTKFYTVYLLAMCCVACIVYLAKTKVLP 

MlllllllllilllhllllllhMIIIIIIIMIilllllMhllllllllMIM 

g015 FSPFNAPWLGTKILLLFAYIALGMVl^RARPRSTKFYTVYLLAMCCIACIVYIAKTKVLP 
70 80 90 100 110 120 



m015 .pep 



g015 



FX 

I I 
FX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 99>: 

g016 . seq 

1 atGCAGCAGG GGCagttggt tggacgcgtc gcccgcaata AAGATATGCG 

51 GAATgctggt CTGCATggtC AGCGGATCGG CAACGGGtac gccgcgcgcg 

101 tctttgTCGA TATTGATGTT TTCCAAACCG ATATtgTCAA CGTTCGGACG 

151 GCgACCTACG GCTGCCAACA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT tctACATTGC CGTCTGCGTC GAGTTTGACC 

251 TCGGTTTTAG CATCCAGATG CAGTTTCAAT tctTCTCCGA ACACGGCTTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 100; ORF 018.ng>: 

g018 .pop 

1 MQQGQLVGRV ARNKDMRNAG LHGQRIGNGY AARVFVDIDV FQTDIVNVRT 

51 ATYGCQHIFG NKY AFFAILL PMDFYIAVCV EFDLGFSIQM QFQFFSEHGF 
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101 RLV* 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 101 >■ 

m018.seq 

1 ATGCAGCAGA GGCAGTTGGT TGGACGCATC GCCTGCGATG AAGATATGCG 

51 GAATACTGGT CTGCATGGTC AGCGGGTCGG CAACAGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATATT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCCACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCATC GAGTTTGACC 

251 TCGGTTTTAG CATCCAGATG CAGTTTCAAT TCTTCGCCGA ACACGGCGTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 102; ORF 01 8>: 

m018.pep 

1 MQQRQLVGRI ACDEDMRNTG LHGQRVGNRY AARIFFDIDI FQTDIVNVRT 
51 AAHGCQHIFG NKY AFFAILL PMDFYIAVCI EFDLGFSIQM QFQFFAEHGV 
101 RLV* 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 103>- 

a018.seq ^ 

1 ATGCAGCAGG GGCAGTTGGT TGGACGCGTC GCCCGCAATA AAGATATGCG 

51 GAATACTGGT CTGCATAGTC AGCGGATCGG CAACGGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATGTT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCTACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCGTC GAGTTTGGCC 

251 TCGGTTTTAG CATCCAAATG CAGTTTCAAT TCTTCACCGA ACACGGCTTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 104; ORF 018.a>- 

a018.pep 

1 MQQGQLVGRV ARNKDMRNTG LHSQRIGNGY AARIFFDIDV FQTDIVNVRT 

51 AAYGCQHIFG NKY AFFAILL PMDFYIAVCV EFGLGFSIQM QFQFFTEHGF 

101 RLV* ' 

moi8/aoi8 86.4% identity over a 103 aa overlap 

10 20 30 40 50 60 

m018.pep MQQRQLVGRIACDEDMRNTGLHGQRVGNRYAARIFFDIDI FQTDIVNVRT AAHGCQHIFG 

1,1 '1111:1 : : I II I i I I I : I I : | i M I I I I I I I I : I I || || I I I I I I : I I I I I I I 
MQQGQL VGR VARNKDMRNTGLHSQRIGNGYAAR I FFD I DVFQTD I VNVRT AAYGCQHIFG 
10 20 30 40 50 60 



a018 

m018 .pep 
a018 



7 ° 80 90 100 

NKYAFFAILLPMDFYIAVCIEFDLGFSIQMQFQFFAEHGVRLVX 
I I I I I M I I I I I I I I I I I I : I I M I I I I I I I | II : I I I | M I 
NKYAFFAILLPMDFYIAVCVEFGLGFSIQMQFQFFTEHGFRLVX 

7 0 80 90 100 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 018 shows 84.5% identity over a 103 aa overlap with a predicted ORF (ORF 018 ne) 
from K gonorrhoeae: ' BJ 

m018/g018 



m018 .pep 



10 20 30 40 50 60 

MQQRQLVGRIACDEDMRNTGLHGQRVGNRYAARIFFDIDIFQTDIVNVRTAAHGCQHIFG 



M 1 11111:1 : : I M I : I I | | | | : | | | M I : I I I I : I I I I || I I I I I :• I I I I I I I 
9018 MQQGQLVGRVARNKDMRNAGLHGQRIGNGYAARVFVDIDVFQTDIVNVRTATYGCQHTFG 

10 20 30 40 50 60 



BNSDOCID: <WO 9957280A2_I_> 



WO 99/57280 




PCT/US99/09346 



70 80 90 100 

m018 . pep NKYAFFAILLPMDFYIAVCIEFDLGFSIQMQFQFFAEHGVRLVX 
I I I I I I I I I I I I M I I I I I : I I I I I I I i I I I I I I I : I I I I I I I 
g018 NKYAFFAILLPMDFYIAVCVEFDLGFSIQMQFQFFSEHGFRLVX 

70 80 90 100 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 105>: 

g019.seq (partial) 

1 . .ctgctggcgg ccctggtgct tgccgcgtgt tcttcgACAA ACAcacTGCC 
51 AGCCGGCAAG ACCCCGGCAG ACAATATAGA AActgcCgAC CTTTCGGCAA 
101 GCGTTCCCAC ccgcCCTGCC GAACCGGAAG GAAAAACGCT GGCAGATTAC 

151 GGCGGCTACC CGTCCGCACT GGATGCAGTG AAACAGAACA ACGATGCGGC 

2 01 AGCCGCCGCC TATTTGGAAA Acgcaggaga cagCGcgatg gcGGAAAatg 
251 tccgcaagga gtgGCTGa 

This corresponds to the amino acid sequence <SEQ ID 106; ORF 019.ng>: 

g019.pep (partial) 

1 . . LLAALVLAAC SSTNTLPAGK TPADNIETAD LSASVPTRPA EPEGKTLADY 
51 GGYPSALDAV KQNNDAAAAA YLENAGD S AM AENVRKEWL* 

The following partial DNA sequence was identified in A^ meningitidis <SEQ ID 107>: 

m019.seq (partial) 

1 ATGTACCTAC CCTCTATGAA GCATTCCCTG CCGCTGCTGG CGGCCCTGGT 

51 GCTTGCCGCG TGTTCTTCGA CAAACACACT GCCAGCCGGC AAGACCCCGG 

101 CAGACAATAT AGAAACTGCC GACCTTTCGG CAAGCGTTCC CACCCGCCCT 

151 GCCGAACCCG AAAGAAAAAC GCTGGCAGAT TACGGCGGCT ACCCGTCCGC 

201 ACTGGATGCA GTGAAACAGA AAAACGATGC CGCCGTCGCC GCCTATTTGG 

251 AAAACGCCGG CGACAGCGCG ATGGCGGAAA ATGTCCGCAA CGAGTGGCTG 

301 AAGTCTTTGG GCGCACGCAG ACAGTGGACG CTGTTTGCAC AGGAATACGC 

351 CAAACTCGAA CCGGCAGGGC GCGCCCAAGA AGTCGAATG C TACGCCGATT 

401 CGAGCCGCAA CGACTATACG CGTGCCGCTG AACTGGTCAA AAATACGGGC 

451 AAACTGCCTT CGGGCTGCAC CAAACTGTTG GAACAGG CAG CCGCATCCGG 

501 CTTGTTGGAC GGCAACGACG CCTGGAGGCG CGTGCGCGGA CTGCTGGCCG 

551 GCCGCCAAAC CACAGACGCA CGCAACCTTG CCGCCGCATT GGGCAGCCCG 

601 TTTGACGGCG GTACACAAGG TTCGCGCGAA TATGCCCTGT TGAACGTCAT 

651 CGGCAAAGAA GCACGCAAAT CGCCGAATGC CGCCGCCCTG CTGTCCGAAA 

701 TGGAAAGCGG TTTAAGCCTC GAACAACGCA GTTTCGCGTG GGGCGTATTG 

751 GGGCATTATC AGTCGCAAAA CCTCAATGTG CCTGCCGCCT TGGACTATTA 

801 CGGCAAGGTT GCCGACCGCC GCCAACTGAC CGACGACCAA ATCGAGTGGT 

851 ACGCCCGCGC CGCCTTGCGC GCCCGACGTT GGGACGAGCT GGCCTCCGTT 

901 ATCTCGCATA TGCCCGAAAA ACTGCAAAAA AGCCCGACCT GGCTCTACTG 

951 GCTGGCACGC AGCCGCGCCG CAACGGGCAA CACGCAAGAG GCGGAAAAAC 

1001 TTTACAAACA GGCGGCAGCG ACGGGCAGGA ATTTTTATGC GGTGCTGGCA 

1051 GGGGAAGAAT TGGGTCGGAA AATCGATACG CGCAACAATG TGCCCGATGC 

1101 CGGCAAAAAC AGCGTCCGCC GCATGGCGGA AGACGGTGCA GTCAAACGCG 

1151 CACTGGTACT GTTCCAAAAC AGCCAATCTG CCGGTGATGC AAAAATGCGC 

12 01 CGTCAGGCTC AGGCGGAATG GCGTTTTGCC ACACGCGGCT TTGACGAAGA 
1251 CAAGCTGCTG ACCGCCGCGC AAACCGCGTT CGACCACGGT TTTTACGATA 

13 01 TGGCGGTCAA CAGCGCGGAA CGCACCGACC GCAAACTCAA CTACACCTTG 

13 51 CGCTATATTT CGCCGTTTAA AGACACGGTA ATCCGCCACG CGCAAAATGT 
1401 TAATGTCGAT CCGGCTTGGG TTTATGGGCT GATTCGTCAG GAAAGCCGCT 

14 51 TCGTTATAGG CGCGCAATCC CGCGTAGGCG CGCAGGGGCT GATGCAGGTT 
1501 ATGCCTGCCA CCGCGCGCGA AATCGCCGGC AAAATCGGTA TGGATGCCGC 
1551 ACAACTTTAC ACCGCCGACG GG. . . 

This corresponds to the amino acid sequence <SEQ ID 108; ORF 01 9>: 

m019.pep (partial) 

1 MYLPSMKHSL PLLAALVLAA CSSTNTLPAG KTPADNIETA DLSASVPTRP 
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51 AEPERKTTjAD YGGYPSALDA VKQKNDAAVA AYLENAGDSA MAENVRNEWL 

101 KSLGARRQWT LFAQEYAKLE PAGRAQEVEC YADSSRNDYT RAAELVKNTG 

151 KLPSGCTKLL EQAAAS GLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGSP 

201 FDGGTQGSRE YALLNVIGKE ARKS PNAAAL LSEMESGLSL EQRSFAWGVL 

251 GHYQSQNLNV PAALDYYGKV ADRRQLTDDQ IEWYARAALR ARRWDELASV 

301 ISHMPEKLQK SPTWLYWLAR SRAATGNTQE AEKLYKQAAA TGRNFYAVLA 

351 GEELGRKIDT RNNVPDAGKN SVRRMAEDGA VKRALVLFQN SQSAGDAKMR 

401 RQAQAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAE RTDRKLNYTL 

451 RYISPFKDTV IRHAQNVNVD PAWVYGLIRQ ESRFVIGAQS RVGAQGLMQV 

501 MPATARE I AG KIGMDAAQLY TADG. 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 109>- 

a019.seq ^ 
ATGTACCCAC CCTCTCTGAA GCATTCCCTG CCGCTGCTGG TGGNCCTGGT 
GCTTGCCGCG TGTTCTTNGA CAAACACACT GTCAGCCGAC AAGACCCCGG 
CAGACAATAT AGAAACTGCC GACCTTTCGG CAAGCGTTCC CACCNGCCCT 
GCCGAACCCG AANGAAAAAC GTNGGCAGAT TACGGCGGCT ACCCGTCCGC 
ACTGGATGCA GTGAAACAGA AAAACGATGC CGCCGTCGCC GCCTATTTGG 
AAAACGCCGG CGACAGCGCG ATGGCGGAAA ATGTCCGCAA CGAGTGGCTG 
AAGTCTTTGG GCGCGCGCAG ACAGTGGACG CTGTOTGCAC ANGAATATGC 
NAAACTCGAA CCGGCANGGC GCGCCCAAGA AGTCGAATGC TACGCCGATT 
CGAGCCGCAA CGACTATACG CGTGCCGCCG AACTGGTCAA AAATACGGGC 
AAACTGCCTT CGGGCTGCAC CAAACTGTTG GAACAGGCAG CCGCATCCGG 
CTTGTTGGAC GGCAACGACG CCTGGAGGCG CGTGCGCGGA CTGCTGGCCG 
GCCGCCAAAC CACAGACGCA CGCAACCTTG CCGCCGCATT GGGCAGCCCG 
TTTGACGGCG GTACACAAGG TTCGCGCGAA TATGCCCTGT TGAACGTCAT 
CGGCAAAGAA GCACGCAAAT CGCCGAATGC CGCCGCCCTG CTGTCCGAAA 
TGGAAAGCGG TTTAAGCCTC GAACAACGCA GTTTCGCGTG GGGCGTATTG 
GGGCATTATC AGTCGCAAAA CCTCAATGTG CCTGCCGCCT TGGACTATTA 
NGGCAAGGTT GCCGACCGCC GCCAACTGAC CGACGACCAA ATCGAGTGGT 
ACGCCCGCGC CGCNNTNNGC NNNCGNNGTT NGNANGANNT GGCNNCCGNN 
ANCNCGNNNN TGCNNGANAA ACNNNNNNAN AGNCNNANNT NGNTNNANTG 
NNTGGCACGC AGCCGCGCCG CNACGGG CAA CACGCAANAN GCGGANAAAC 
TNTACAAACA GGCGGC AG CA NCGGGCANGA ATTTTTATGC NGTGCTGNCN 
GGGGAAGAGT TGGGGCGCAN AAT CGATACG CGCAACAATG TGCCCGATGC 
CGGCAAAANC AGCGTCCTCC GTATGGCGGA AGACGGCGCG ATTAAGCGCG 
CGCTGGTGCT GTTCCGAAAC AGCCGAACCG CCGGCGATGC GAAAATGCGC 
CGTCNGGCTC AGGCGGAATG GCGTTTCGCC ACACGCGGCT TCGATGAAGA 
CAAGCTGCTG ACCGCCGCGC AAACCGCGTT CGACCACGGT TTTTACGATA 
TGGCGGTCAA CAGCGCGGAA CGCACCGACC GCAAACTCAA CTACACCTTG 
CGCTACATTT CGNNNNNTNA NGACACGGTA ATCCGCCACG CGCAAAATGT 
TAATGTCGAT CCGGCGTGGG TTTACGGGCT GATTCGTCAG GAAAGCCGCT 
TCGTTATGGG CGCGCAATCC CGCGTAGGCG CGCAGGGGCT GATGCAGGTT 
ATGCCTGCCA CCGCGCGCGA AATCGCCGGC AAAATCGGTA TGGATGCCGC 
ACAACTTTAC ACCGCCGACG GCAATATCCG TATGGGGACG TGGTATATGG 
CGGACACCAA ACGCCGCCTG CAAAACAACG AAGTCCTCGC CACCGCAGGC 
TATAACGCCG GTCCCGGCAG GGCGCGCCGA TGGCAGGCGG ACACGCCCCT 
CGAAGGCGCG GTATATGCCG AAACCATCCC GTTTTCCGAA ACGCGCGACT 
ATGTCAAAAA AGTGATGGCC AATGCCGCCT ACTACGCCTC CCTCTTCGGC 
GCGCCGCACA TCCCGCTCAA ACAGCGTATG GGCATTGTCC CCGCCCGCTG 
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951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 



This corresponds to the amino acid sequence <SEQ ID 1 10; ORF 019 a>* 

a019.pep 

MYPPSLKHSL PLLVXLVLAA CSXTOT T.fiAn KTPADNIETA DLSASVPTXP 
AEPEXKTXAD YGGYPSALDA VKQKNDAAVA AYLENAGDSA MAENVRNEWL 
KSLGARRQWT LXAXEYAKLE PAXRAQEVEC YADSSRNDYT RAAELVKNTG 
KLPSGCTKLL EQAAASGLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGSP 
FDGGTQGSRE YALLNVIGKE ARK S PNAAAL LSEMESGLSL EQRSFAWGVL 
GHYQSQNLNV PAALDYXGKV ADRRQLTDDQ IEWYARAAXX XRXXXXXAXX 
XXXXXXKXXX XXXXXXXXAR SRAATGNTQX AXKLYKQAAA XGXNFYAvS 
GEELGRXIDT RNNVPDAGKX SVLRMAEDGA IKRALVLFRN SRTAGDAKMR 
RXAQAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAE RTDRKL™ 
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101 
151 
201 
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451 RYISXXXDTV IRHAQNVNVD PAWVYGLIRQ ESRFVMGAQS RVGAQGLMQV 

501 MPATAREIAG KIGMDAAQLY TADGNIRMGT WYMADTKRRL QNNEVLATAG 

551 YNAGPGRARR WQADTPLEGA VYAETIPFSE TRDYVKKVMA NAAYYASLFG 

601 APHIPLKQRM GIVPAR* 

moi9/aoi9 88.9% identity over a 524 aa overlap 

10 20 30 40 50 60 

mOl 9 . pep MYLPSMKHSLPLLAALVLAACSSTNTLPAGKT PADNIETADLSASVPTRPAEPERKTLAD 

II I I : I I I If M : I I I I I I I I I I I I I I I I I I M I I II II I M I Mill t I II 
a019 MYPPSLKHSLPLLVXLVLAACSXTNTLSADKTPADNIETADLSASVPTXPAEPEXKTXAD 

10 20 30 40 50 60 

70 80 90 100 110 120 

mOl 9 .pep YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLFAQEYAKLE 
I I i I I M I M II N 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a019 YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLXAXEYAKLE 

70 80 90 100 110 120 

130 140 150 160 170 180 

mOl 9 . pep PAGRAQEVECYADSSRNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 

M I I I I I I I I I I I I M I I M II I I I I II I i I M II I | | M | ] | | | | | | M Ml 

a019 PAXRAQEVECYADSSRNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 

130 140 150 160 170 180 

190 200 210 220 230 240 

mOl 9 . pep LLAGRQTTDARNLAAALGSPFDGGTQGSREYALLNVIGKEARKS PNAAALLSEMESGLSL 

I II I I I I I I I M I I I I I I I I II M I I M I I ! I I I I I i I I I I | M I M I I I II I I I I I I | | 
a019 LLAGRQTTDARNLAAALGSPFDGGTQGSREYALLN VI GKE ARKS PNAAALLSEMESGLSL 

190 200 210 220 230 240 

250 260 270 280 290 300 

mOl 9 . pep EQRSFAWGVLGHYQSQNLNVPAALDYYGKVADRRQLTDDQIEWYARAALRARRWDELASV 

I I I M I I I I I I I I I I I I I I I I I I I I I 11 I I I I ! I I I I | | | M I I I I I I I 
a019 EQRSFAWGVLGHYQSQNLNVPAALDYXGKVADRRQLTDDQIEWYARAAXXXRXXXXXAXX 

250 260 270 280 290 300 

310 320 330 340 350 360 

mOl 9 .pep ISHMPEKLQKS PTWLYWLARSRAATGNTQEAEKLYKQAAATGRNFYAVLAGEELGRKIDT 

I : 1 I I I I II I I I I I I M I I I I I : I I I I I I I I I I I I I Ml 

a 0 1 9 XXXXXXKXXXXXXXXXXXARSRAATGNTQXAXKLYKQAAAXGXNFYAVLXGEELGRXI DT 

310 320 330 340 350 360 

370 380 390 400 410 420 

mO 1 9 . pep RNNVPDAGKNS VRRMAEDGAVKRALVLFQNSQSAGDAKMRRQAQAEWRFATRGFDEDKLL 

I I I I I I I I I M 1 If M I I : I I I II I I : M : : I I I i M I I I I M I I I I I I I I I I I I M 
a019 RNNVPDAGKXSVLRMAEDGAIKRALVLFRNSRTAGDAKMRRXAQAEWRFATRGFDEDKLL 

370 380 390 400 410 420 

430 440 450 460 470 480 

mOl 9 . pep TAAQTAFDHGFYDMAVNSAERTDRKLNYTLRYISPFKDTVIRHAQNVNVDPAWVYGLIRQ 

MINIM I I I I I | | | M I M I I I I I I I I I M I II 1 I I I I I I I I I II I II I I 

a019 T AAQT AFDHG FY DMAVN S AERT DRKLN YT LRY I SXXXDTV I RHAQNVMVD PAWVYGLIRQ 

430 440 450 460 470 480 

490 500 510 520 

mO 1 9 . pep ESRFVIGAQSRVGAQGLMQVMPATAREIAGKIGMDAAQLYTADG 
I M I I : I I I I I I M II I I I M I I I I I I I I I I i I I M M I I I I I I 
a019 ESRFVMGAQSRVGAQGLMQVMPATAREIAGKIGMDAAQLYTADGNIRMGTWYMADTKRRL 

490 500 510 520 530 540 

a 0 1 9 QNNE VLAT AG YNAG PGRARRWQADT PLEGAV YAET I P FSE T RD Y VKKVMAN AAY Y AS L FG 

550 560 570 580 590 600 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 019 shows 95.5% identity over a 89 aa overlap with a predicted ORF (ORF 019 ne) 
from N. gonorrhoeae: ' 
g019/m019 



g0i9 .pep 



10 20 30 40 49 

LLAALVLAACSSTNTLPAGKTPADNIETADLSASVPTRPAEPEGKTLAD 



m019 WyLPSMKHSLPLLAALVIAACSSTNTLPAGKTPAD 

10 20 3° 40 50 60 

50 60 70 80 89 

g0 1 9 . pep YGGYPSALDAVKQNNDAAAAAYLENAGDSAMAENVRKEWL 

1 1 1 1 1 II II M M = i 1 1 h I M I M | I I I M II M I : I II 

m019 YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLFAQEYAKLE 

70 80 90 100 HO 120 

The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 1 1 1>- 

g023 . seq ^ 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 AATGCAGCGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCCCTG CCTAAAGAAT ATCCGGCATG GCAGGCATTT 

151 TTTAGTCAAG CTTGGGTAAA AGTATTTACC CAAGTGAGCT TTATCGCCGT 

2 01 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 
251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT TGtCtGGCTG 

3 01 GTCGGCTGCC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 1 12- ORF 023 ne>- 

g023.pep ' B 

1 MVERKLTGA H YGLRDWVMQR ATAVIMLIYT VALLWLFAL P KEYPAWOAF 

51 FSQAWVKVFT QVS FIAVFLH AWVGIRDLWM DYIKPFGVRL FLOVATIVWL 

101 VGCLVY SVKV IWG* ~ 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 1 13>- 

m023 . seq ^ 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 GATGCAACGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTTCCCTG CCTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAGTCAAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 

201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 

251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTTTGGCTG 

3 01 GTCGGCTGTC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 114- ORF 023>- 

m023.pep ' 

1 MVERKLTGA H YGLRDWVMQR ATA VI MLIYT VALLWLFSL P KEYSAWOAF 

51 FSQTWVKVFT QVS FIAVFLH AWVGIRDLWM DYIKPFGVRL FLOVATIVWL 

101 VGCLVY SVKV IWG* " 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 5>; 



a023. seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GGGATTGGGC 

51 GATGCAACGT GCGACCGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCTCTG CCTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAGTCAAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 
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201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATNA 
251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTCTGGCTG 
301 GTCGGCTGCT TGGTGTATTC AATTAAAGTA ATTTGGGGGT AA 



This corresponds to the amino acid sequence <SEQ ID 116; ORF 023 ,a>: 

a023.pep 

1 MVERKLTGAH YGLRDWAMQR ATA V IMLIYT VALLWLFAL P KEYSAWQAF 

51 FSQTWVK VFT QVSFIAVFLH AWVGI RDLWM DYXKPFGVR L FLQVAT IVWL 

101 VGCLVYSIKV IWG* 



m023/a023 96.5% identity over a 1 13 aa overlap 



10 20 30 40 50 60 

m023 . pep MVERKLTGAHYGLRDWVMQRATAVIMLIYTVALLWLFSLPKEYSAWQAFFSQTWVKVFT 
I 1 I I I I M I I ! I I I I I : I I El I I I I I I I I I I II I II I I : I I I II I I 1 I I M I I I j I M I I 
a023 MVERKLTGAHYGLRDWAMQRATAVIMLIYTVALLVVLFALPKEYSAWQAFFSQTWVKVFT 

10 20 30 40 50 60 



70 80 90 100 110 

m023.pep QVS FI AVFLHAWVG I RDLWMDY I KPFGVRL FLQVAT I VWLVGCLVY S VKVIWGX 

M I I I I I II I I I I I M I I I I I I I I I I I I I I i I I I I I I I I I I I I i II : I I I I M 
a023 QVSFIAVFLHAWVGIRDLWMDYXKPFGVRLFLQVATIVWLVGCLVYSIKVIWGX 

70 80 90 100 110 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from AT. gonorrhoeae 

ORF 023 shows 97.3% identity over a 1 13 aa overlap with a predicted ORF (ORF 023. ng) 
from TV. gonorrhoeae: 

g023/m023 

10 20 30 40 50 60 

g023 . pep MVERKLTGAHYGLRDWVMQRATAVIMLIYTVALLVVLFALPKEYPAWQAFFSQAWVKVFT 

Ml Ml M : M M M I II M 1 1! I MM I M MMI M MUM I I II 1 1 ! II = 1 1 1 1 1 i 

m023 MVERKLTGAHYGLRDWVMQRATAVIMLIYTVALLWLFSLPKEYSAWQAFFSQTWVKVFT 

10 20 30 40 50 60 



70 80 90 100 110 

g023 . pep QVS F I AVFLHAWVG I RDLWMDY I KP FGVRLFLQVAT I VWLVGCLVYS VKV I WGX 

II M I! IM M ! M Ml I I I Ml Ml M II I MM II I MM! II i M Ml I I i 

m0 2 3 QVS F I AVFLHAWVG I RDLWMDY I KPFGVRL FLQVAT I VWLVGCLVYS VKVIWGX 

70 80 90 100 110 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 1 17>: 

g025 . seq 

1 ATGTTGAAAC AAAcgACACT TTTGGCAGCT TGTACCGCCG TTGCCGCTCT 

51 GTTGGGCGGT TGcgCCACCC AACAGCCTGC TccTGTCATT GCAGGCAATT 

101 CAGGTATGCA GACCGTATCG TCTGCGCCGG TTTACAATCC TTATGGCGCA 

151 ACGCCGTACA ATGCCGCTCC TGCCGCCAac gatgcGCCgT ATGTGCCGCC 

2 01 CGTGCAAact gcgccggttT ATTCGCCTCC TGCTTATGTT CCGCcgtCTG 
251 CACCTGCCGT TTCGGgtaca tatgtTCCTT CTTACGCACC CgtcgACATC 

3 01 aacgCGGCGa cgCataCTAT TGTGCGTGGC GACACgGtgt acaACATTTc 
3 51 caaAcgCtac CATATCTCTC AAGACGATTT CCGTGCGTGG AACGGCATGA 
401 CCGACAATAC GTTGAGCATC GGTCAGATTG TTAAAGTCAA ACCGGCaggA 
451 TATGCCGCAC CGAAAACCGC AG CCGTAGAA AGCAGGCCCG CCGTACCGGC 
501 TGCCGCGCAA ACCCCTGTGA AACCCGCCGC gcaACCGCCC GTTCAGTCCG 
551 CGCCGCAACC TGCCGCGCCC GCTGCGGAAA ATAAAGCGGT TCCCGCCCCC 
601 GCGCCCGCCC CGCAATCTCC TGCCGCTTCG CCTTCCGGCA CGCGTTCGGT 
651 CGGCGGCATT GTTTGGCAGC GTCCGACCCA AGGTAAAGTG GTTGCCGATT 
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701 TCGGCGGCGG CAACAAGGGT GTCGATATTG CCGGCAATGC CGGACAACCC 
751 GTTTTGGCGG CGGCTGACGG CAAAGTGGTT TATGCCGGTT CAGG TTTGAG 
801 GGGATACGGA AACTTGGTCA TCATCCAGCA CAATTCCTCT TTCCTGACCG 
851 CGTACGGGCA CAACCAAAAA TTGCTGGTCG GCGAAGGTCA GCAGGTCAAA 
9 01 CGCGGTCAGC AGGTTGCTTT GATGGGTAAT ACCGATGCTT CCAGAACGCA 
GCTTCATTTC GAGGTGCGTC AAAACGGCAA ACCGGTTAAC CCGAACAGCT 



951 



10 01 ATATCGCGTT CTGA 

This corresponds to the amino acid sequence <SEQ ID 1 18; ORF 025 ng>- 

g025.pep ' 6 

1 MLKQTTLLA A CTAVAALLGG CATQQPAPVI AGNSGMQTVS SAPVYNPYGA 

51 TPYNAAPAAN DAPYVPPVQT APVYSPPAYV PPSAPAVSGT YVPSYAPVDI 

101 NAATHTIVRG DTVYNISKRY HISQDDFRAW NGMTDNTLS I GQIVKVKPAG 

151 YAAPKTAAVE SRPAVPAAAQ TPVKPAAQPP VQSAPQPAAP AAENKAVPAP 

201 APAPQSPAAS PSGTRSVGGI VWQRPTQGKV VAD FGGGNKG VDIAGNAGQP 

251 VIiAAADGKW YAGSGLRGYG NLVIIQHNSS FLTAYGHNQK LLVGEGQQVK' 

3 01 RGQQVALMGN TDASRTQLHF EVRQNGKPVN PNSYIAF* 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 1 19>- 

m025.seq (partial) ^ * 

1 . .GTGCCGCCGG TGCAAAGCGC GCCGGTTTAT ACGCCTCCTG CTTATGTTCC 

51 GCCGTCTGCA CCTGCCGTTT CGGGTACATA CGTTCCTTCT TACGCACCCG 

101 TCGACATCAA CGCGGCGACG CATACTATTG TGCGCGGCGA CACGGTGTAC 

151 AACATTTCCA AACGCTACCA TATCTCTCAA GACGATTTCC GTGCGTGGAA 

2 01 CGGCATGACC GACAATACGT TGAGCATCGG TCAGATTGTT AAAGTCAAAC 
251 CGGCAGGATA TGCCGCACCG AAAGCCGCAG CCGTAAAAAG CAGGCCCGCC 

3 01 GTACCGGCTG CCGCGCAACC GCCCGTACAG TCCGCACCCG TCGACATTAA 

3 51 CGCGGCGACG CATACTATTG TGCGCGGCGA CACGGTGTAC AACATTTCCA 

4 01 AACGCTACCA TATCTCTCAA GACGATTTCC GTGCGTGGAA CGGCATGACC 
451 GACAATATGT TGAGCATCGG TCAGATTGTT AAAGTCAAAC CGGCAGGATA 

5 01 TGCCGCACCG AAAACCG CAG CCGTAGAAAG CAGGCCCGCC GTACCGGCTG 
551 CCGTGCAAAC CCCTGTGAAA CCCGCCGCGC AACCGCCTGT GCAGTCCGCG 
601 CCGCAACCTG CCGCGCCCGC TGCGGAAAAT AAAGCGGTT C CCGCGCCCGC 
651 CCCGCAATCT CCTGCCGCTT CGCCTTCCGG CACGCGTTCG GTCGGCGGCA 
701 TTGTTTGGCA GCGTCCGACG CAAGGTAAAG TGGTTGCCGA TTTCGGCGGC 
751 AACAACAAGG GTGTCGATAT TGCCGGTAAT GCGGGACAGC CCGTTTTGGC 
801 GGCGGCTGAC GGCAAAGTGG TTTATGCCGG TTCAGGTTTG AGGGGATACG 
851 GAAACTTGGT CATCATCCAG CATAATTCTT CTTTCCTGAC CGCATACGGG 
901 CACAACCAAA AATTGCTGGT CGGCGAGGGG CAGCAGGTCA AACGCGGTCA 
951 GCAGGTTGCT TTGATGGGCA ATACCGATGC TTCCAGAACG CAGCTTCATT 

1001 TCGAGGTGCG TCAAAACGGC AAACCGGTTA ACCCGAACAG CTATATCGCG 

1051 TTCTGA 

This corresponds to the amino acid sequence <SEQ ID 1 10* ORF 025>- 

m025.pep (partial) 9 

1 - .VPPVQSAPVY TPPAYVPPSA PAVSGTYVPS YAPVDINAAT HTIVRGDTVY 

51 NISKRYHISQ DDFRAWNGMT DNTLSIGQIV KVKPAGYAAP KAAAVKSRPA 

101 VPAAAQPPVQ SAPVDINAAT HTIVRGDTVY NISKRYHISQ DDFRAWNGMT 

151 DNMLS I GQ IV KVKPAGYAAP KTAAVESRPA VPAAVQTPVK PAAQPPVQSA 

201 PQPAAPAAEN KAVPAPAPQS PAASPSGTRS VGGIVWQRPT QGKWADFGG 

2 51 NNKGVD I AGN AGQPVLAAAD GKWYAGSGL RGYGNLVIIQ HNS S FLTAYG 

3 01 HNQKLLVGEG QQVKRGQQVA LMGNTDASRT QLHFEVRQNG KPVNPNSYIA 
351 F* 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 1 1 1>- 

a025.seq " 

1 ATGTTGACAC CAACAACACT TTAGGTAGCT TGTACCGCCC TTGCCGCTCA 

51 GTTGGGCGGA TGCCCCACCC AACACCCTTC TCCTGTCATT GCAGGCAATT 

101 CAGGTATGCA GACCGTACCG TCTGCGCCGG TTTACAATCC TTATGGCGCA 
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151 ACGCCGTACA ATGCCGCTCC 

201 GGTGCAAAGC GCGCCGGTTT 

251 CACCTGCCGT TTCGGGTACA 

301 AACGCGGCGA CCCATACTAT 

351 CAAATGCTAC CATATCTCTC 

4 01 CCGACAATAC GTTGAGCATC 

451 TATGCCGCAC CGAAAGCCGC 

501 TGCCGCGCAA CCGCTCGTAC 

551 CGCATACTAT TGTGCGCGGC 

601 CATATCTCTC AAGACGATTT 

651 GTTGAGCATC GGTCAGATTG 

7 01 CGAAAGCCGC AGCCGTAAAA 

7 51 ACCCCTGTGA AACCCGCCGC 

801 TGCCGCGCCC GCTGCGGAAA 

851 CTCCTGCCGC TTCGCCTTCC 

901 CAGCGTCCGA CGCAAGGTAA 

951 GGGTGTCGAT ATTGCAGGAA 

1001 ACGGCAAAGT GGTTTATGCA 

1051 GTCATCATCC AGCATAATTC 

1101 AAAATTGCTG GTCGGCGAAG 

1151 CTTTGATGGG CAATACCGAG 

1201 CGGCAAAACG GCAAACCGGT 



TGCCGCCAAC GATGCGCCGT ATGTGCCGCC 
ATANGCCTCC TGCTTATGTT CCGCCGTCTG 
TACGTTCCTT CTTACGCANC CGTCGACATC 
TGTGCGCGGC GACACCGTGT ACAAGATTTC 
AAGACGATTT CCGTGCGTGG AACGGCATGA 
GGTCAGATTG TTAAAGTCAA ACCGGCAGGA 
AGCCGTAAAA AGCAGGCCCG CCGTACCGGC 
AGTCCGCACC CGTCGACATC AACGCGGCGA 
GACACGGTGT ACAACATTTC CAAACGCTAC 
CCGTGCGTGG AACGGCATGA CCGACAATAC 
TTAAAGTCAA ACCGGCAGGA TATGCCGCAC 
AGCAGGCCCG CCGTACCGGC TGCCGTGCAA 
GCAACCGCCT GTGCAGTCCG CGCCGCAACC 
ATAAAGCGGT TCCCGCGCCC GCCCCGCAAT 
GGCACGCGTT CGGTCGGCGG CATTGTTTGG 
AGTGGTTGCC GATTTCGGCG GCAACAACAA 
ATGCGGGACA GCCCGTTTTG GCGGCGGCTG 
GGTTCCGGTT TGAGGGGATA CGGCAATTTG 
TTCCTTCCTG ACCGCATACG GGCACAACCA 
GCCAGCAGGT CAAACGCGGG CAGCAGGTCG 
GCTTCTAGAA CGCAGCTTCA TTTCGAGGTG 
TAATCCGAAC AGCT AT AT CG CGTTCTGA 



This corresponds to the amino acid sequence <SEQ ID 1 12; ORF 025. a>: 

a025.pep 

1 MLTPTTL*VA CTALAAQLG G CPTQHPSPVI AGNSGMQTVP SAPVYNPYGA 

51 TPYNAAPAAN DAPYVPPVQS APVYXPPAYV PPSAPAVSGT YVPSYAXVDI 

101 NAATHTIVRG DTVYKISKCY HISQDDFRAW NGMTDNTLSI GQIVKVKPAG 

151 YAAPKAAAVK SRPAVPAAAQ PLVQSAPVDI NAATHTIVRG DTVYNISKRY 

201 HISQDDFRAW NGMTDNTLSI GQIVKVKPAG YAAPKAAAVK SRPAVPAAVQ 

251 TPVKPAAQPP VQSAPQPAAP AAENKAVPAP APQSPAASPS GTRSVGGIVW 

301 QRPTQGKWA DFGGNNKGVD IAGNAGQPVL AAADGKWYA GSGLRGYGNL 

351 VIIQHNSSFL TAYGHNQKLL VGEGQQVKRG QQVALMGNTE ASRTQLHFEV 

4 01 RQNGKPVNPN SYIAF* 



m025/a025 



97.4% identity over a 351 aa overlap 



10 20 30 

m025 . pep VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 

I I I If I M I I : I I M M i M I I I It I I I I I 
a025 GMQTVPSAPVYNPYGATPYNAAPAANDAPYVPPVQSAPVYXPPAYVPPSAPAVSGTYVPS 
40 50 60 70 80 90 



40 50 60 70 80 90 

m025 . pep YAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 
II I I I I I I I I I I I I I I I II : I I I I I I I I I I I M I II I I I I I I I I I I I I I I I M I I I I I 
a025 YAXVDINAATHTIVRGDTVYKISKCYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 
100 110 120 130 140 150 



100 110 120 130 140 150 

m025 . pep KAAAVKSR PAVPAAAQ P P VQ S APV D I N AATHT I VRG DT VYN I S KR YH I S Q D D FRAWN GMT 

I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I M I | | | | | | | | | | 
a025 KAAAVKSRPAVPAAAQPLVQSAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMT 

160 170 180 190 200 210 

160 170 180 190 200 210 

m025 . pep DNMLSIGQIVKVKPAGYAAPKTAAVESRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 

II I M I I I I I I M I I I t I I I : I I I : I I I I I I I I 1 1 I I I IE II I I I I I I E I I I I I I I I | | 
a025 DNTLSIGQIVKVKPAGYAAPKAAAVKSRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 

220 230 240 250 260 270 



220 230 240 250 260 270 

m025 . pep KAVPAPAPQSPAASPSGTRSVGGIVWQRPTQGKWADFGGNNKGVDIAGNAGQPVIAAAD 
M I I I M I I I I I I I I I I I I I I I I I I I I I I i I I t I I I I I I I I I ! I I I I I I I ( I | | | | i| | | 
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KAVPAPAPQSPAASPSGTRSVGGIVWQRPTQGKWADFGGNNKGVDIAGNAGQPVLAAAD 
280 290 300 310 320 330 



280 290 300 310 320 330 

m025.pep GKVVYAGSGLRGYGNLVIIQHNSSFLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDASRT 
11 M I I I I 1 ! I [ i I M I I j 1 I 1 I | | j j | | 1 j | | M I M M I j M ! M I t I j I I M : | | 1 t 
GKVVYAGSGLRGYGNLVIIQHNSSFLTAYGHNQKLLVGEGQQVKRGQQVALMGNTEASRT 
340 350 360 370 380 390 



340 350 
m02 5 .pep QLHFEVRQNGKPVNPNSYIAFX 
i I I M I f I I I I I | | I | f | | | | | 
a 0 2 5 QLH FE VRQNGKPVN PNSYIAFX 

400 410 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORP from N. gonorrhoeae 

ORF 025 shows 75.6% identity over a 353 aa overlap with a predicted ORF (ORF 025.ng) 
from N. gonorrhoeae: 

m025/g025 



m025 .pep 
g025 



10 20 30 

VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 

GMQTVSSAPVYNPYGATPYNAAPAANDAPYVPPVQ^ 

40 50 60 70 80 



90 



m025 .pep 
g025 

m025 .pep 
g025 



40 50 60 70 80 90 

YAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 

NNIMIIIIIMMMIIIMIIIIMIIIMIIIIIMMIIIIIIIIIIIIIIN! 

YAPVD INAATHTI VRGDTVYNI S KRYHI S QDDFRAWNGMTDNTLS I GQ I VKVKPAG YAAP 
1Q 0 110 120 130 140 



150 



100 



110 



120 130 140 150 

KAAAVKSRPAVPAAAQPPVQSAPVDINAATHTI VRGDTVYNI S KRYHI SQDDFRAWNGMT 



m025 .pep 
g025 

m025 .pep 
g025 

m025 .pep 
g025 

m025 .pep 



160 170 180 190 200 210 

DNMLSIGQIVKVKPAGYAAPKTAAVESRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 

NIIMMIMIhllllllMllllllllliiiiiiii 

TAAVESRPAVPAAAQTPVKPAAQPPVQSAPQPAAPAAEN 



160 



170 



180 



190 



220 230 240 250 260 

KAVPAPAP - - QSPAASPSGTRSVGGIVWQRPTQGKWADFGGNNKGVDIAGNAGOPVLAA 

1 1 f 1 1 1 1 1 !lllillMMIIIIIMMIIIIMIIIIM:|||||||IM||||TTl 
kavpapapapqspaaspsgtrsvggiwqrptqgkwadfgggnkgvdiagnagqpviLaa 

200 210 220 230 



240 



250 



270 



280 



290 300 310 320 

adgkwyagsglrgygnlviiqhnssfltayghnqkllvgegqqvkrgqqvalmgntdas 

f i i i i i » i f i i i f f f i i i t i r i i i i j i i i f i i i t m i t i ( i i i i i i j i ? i i ! i i 7TTTTT 

adgkwyagsglrgygnlviiqhnssfltayghnqkl^^ 

300 310 



260 



270 



280 



290 



330 340 350 

rtqlhfevrqngkpvnpnsyiafx 
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I M II I 1 1 M Mill I III II II I 

g 0 2 5 RTQLHFEVRQNGKPVNPNS YI AFX 

320 330 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 13>: 

g031 . seq 

1 ATGGTGTCCC TCCGCTTCAG ATTCGGCAAC CACTTTAAAC GCCGACATTC 

51 TGACAATTTC CTTTTCCGCC AGCCAAATAT CATGCGTATC TTTCGGTTCG 

101 GGCTTGTTGG GCATGGCAAC CTTCAACAGC CGCGCCATCA CAGGAATCGT 

151 CGTTCCCTGA ATCAGCAGCG ACAGCACCAC CACGGCAAAC GCCACATCAA 

201 ACAGCAGGTG CGAATTGGGA ACGCCCATCA CCAGCGGCAT CATCGCCAGC 

251 GAAATCGGTA CGGCTCCTCG CAAGCCCAAC CAACTGATAT ACGCCTTTTC 

301 ACGCAGGCTG TAATTGAATT TCCACAAACC GCCGAACACT GCCAGCGGAC 

351 GCGCGACCAG CATCAGGAAC GCCGCAATCG CCAAGGCTTC CGCCGCCCTG 

4 01 TCCAACACGC CGGCGGGAGA AACCAGCAGA CCGAGCATGA CGAACAAAGT 

4 51 TGCCTGCGCC AGCCAAGCCA AACCGTCCAT CACACGCAAA ACGTGTTCCG 

501 TcgcACGGTT GCGCTGGTTA CCGACAATGA TGCCGGCAAG GTAAACCGCC 

551 AAAAAGCCGC TGCCGCCTAT GGTATTGGTA AACGCAAACA CAAGCAGCCC 

601 GCCCGACACA ATCATCAGCG CGTACAGACC TTCCGtacac acctccaatt 

651 cccaatcaac gtcatagctg tctcccgtgt taaaatgttc ttcacttcag 

701 aatccccccc ttcttcccag cccgaaacct tcatgtgtta naccctgggg 

751 tgccccaacg gatttagtaa cctcccaatg actctgcttg tcgccccctt 

801 cgcccgcttt ctccttccgg gaaaacttgt tgtccccgtc ttacattaa 

This corresponds to the amino acid sequence <SEQ ID 114; ORF 03 l.ng>: 

g031 .pep 

1 MVSLRFRFGN HFKRRHSDNF LFRQPNIMRI FRFGLVGHGN LQQPRHHRNR 

51 RSLNQQRQHH HGKRHIKQQV RIGNAHHQRH HRQRNRYGSS QAQPTDIRLF 

101 TQAVIEFPQT AEHCQRTRDQ HQERRNRQGF RRPVQHAGGR NQQTEHDEQS 

151 CLRQPSQTVH HTQNVFRRTV ALVTDNDAGK VNRQKAAAAY GIGKRKHKQP 

201 ARHNHQRVQT FRTHLQFPIN VIAVSRVKMF FTSESPPSSQ PETFMCXTLG 

251 CPNGFSNLPM TLLVAPFARF LLPGKLW PV LH* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 15>: 

m031.seq (partial) 

1 ... CGCCTGAAGC ACGGTGTCGG ACTGCATTTC TATTCGGCTA TACGCCTTTT 
51 CACGCAGGCT GTAATTGAAT TTCCACAAAC CGCCGAACAC TGCCGACGGA 
101 CGCGCGACCA GCATCAGGAA CGCCGCAATC GCCAAgGCTT CCGCCGCCCT 
151 GTCCAACACG TTGGCAGGAG AAACCAGCAG CAAAGGCATT CCCAAACGTG 
201 CGGACAAAGT GGTCGAAACC ACGCTCAGAA ACAACAGTGC GCCACCCGGC 
251 AG 

This corresponds to the amino acid sequence <SEQ ED 1 16; ORF 03 1>: 

m0 3 1 . pep (partial ) 

1 . . . RLKHGVGLHF YSAIRLFTQA VIEFPQTAEH CRRTRDQHQE RRNRQGFRRP 
51 VQHVGRRNQQ QRHSQTCGQS GRNHAQKQQC ATRQ. . . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 17>: 

a031.seq 

1 ATACGCCTTT TCACGCAGGC TGTAATTGAA TTTCCACAAA CCGCCGAACA 
51 CTGCCGGCGG ACGCGCGACC AGCAT CAGGA ACGCCGCAAT CGCCAAGGCT 

101 TCCGCCGCCC CGTCCAACAC GTTGGCAGGA GAAACCAGCA GCAAAGGCAT 

151 TCCCAAACGT GCGGACAAAG TGGTCGAAAC CACGCTCAGA AACAACAGTG 
201 CGCCACCCGG CAG 

This corresponds to the amino acid sequence <SEQ ID 118; ORF 031.a>: 

a031.pep (partial) 

1 IRLFTQAVIE FPQTAEHCRR TRDQHQERRN RQGFRRPVQH VGRRNQQQRH 
51 SQTCGQSGRN HAQKQQCATR Q 
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m03i/a03i 100.0% identity over a 71 aa overlap 

10 20 30 40 50 60 

m031.pep RLKHGVGLHFYSAIRLFTQAVIEFPQTAEHCRRTRDQHQERRNRQGFRRPVQHVGRRNQQ 

i'MIMMIMMIIMMIIMiMIHIMiiMMHIMlll 
aUJ1 IRLFTQAVIEFPQTAEHCRRTRDQHQERRNRQGFRRPVQHVGRRNQQ 

10 20 30 40 

70 80 
m031 . pep QRHSQTCGQSGRNHAQKQQCATRQ 
t i I I I M 1 I I I I | | M | | | | | | | | 
a031 QRHSQTCGQSGRNHAQKQQCATRQ 
50 60 70 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 03 1 shows 60.0% identity over a 85 aa overlap with a predicted ORF (ORF 03 1 .ne) 
from N. gonorrhoeae: 

m031/g031 



m0 31 .pep 
g031 



10 20 30 

RLKHGVGLHFYSAIRLFTQAVIEFPQTAEH 



NQQRQHHHGKRHIKQQVRIGNAHHQRHHRQRNRYGSSQAQPTO 

60 7 ° 80 90 100 HQ 



40 50 60 70 80 

m0 3 1 .pep CRRTRDQHQERRNRQGFRRPVQHVGRRNQQQRHS-QTCGQSGRNHAQKOOCATRO 
■ = ""'»» U I I II I I I I | | | | : | MM : I - ■ - ' 



g031 
g031 



1 • 1 • i i i i i I i i j i i i i | | | j | | : | MM : I : I : | : : : : : - | . j . 

CQRTRDQHQERRNRQGFRRPVQHAGGRNQQTEHDEQSCLRQPSQTVHHTQNVFRRTVALV 
120 130 140 150 160 170 

TDNDAGKVNRQKAAAAYGIGKRKHKQPARHNHQRVQTFRTHLQFPINVIAVSRVKMFFTS 
180 190 200 210 220 230 



The following partial DNA sequence was identified in K gonorrhoeae <SEO ID 1 19>- 

g032 . seq v: 

1 ATGCGGCGAA ACGTGCCTGC CGTCGCCGTA TTGCGCCGCC CACGATTCGA 

51 GGCGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAAGGCTTCC ACGCTTTTGC CGGTCAGCGG AACCTGACGC TGCTTGCGCC 

201 CTTTGCCGGT AACGTGTACC CACGCTTCGT CCAAATATAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGCTC 

301 GAACAGCGCG TGGTCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 AATCCAACAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

4 01 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG CGCGCATCAG 

451 CCCGCGTTTG ACCAGCCAGG CGCAATACTG CCGCCACGCC GACAGCTTGC 

501 GCGCCAGCGT CCGACCGTCC AAACCGCGCT GCGACAGCCG CCGCAACGCC 

551 GccgTAAAAT CGCGCCGCGA CAAGTCCTGC GGCACGCcgc ctgcaTCTTC 

601 AGACGGCATT TGTGCCAACA GTGCAAACAG TTCTTCCAAA TCGCGCCGGT 

651 ATGCCGCAAC CGTGTGCTCC GACTTGCCCT CGCGCACGAT GTTTTCCAAA 

7 01 TAAGCGTCAA AATacgccgC AAACccgTCC AAAACCATAA CCGTCCCACA 

751 CAAATATCAA AAAACCAGTG A 



This corresponds to the amino acid sequence <SEQ ID 1 10; ORF 032 ne>* 

g032.pep ' B 

1 MRRNVPAVAV LRRPRFEAFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 
51 QGFHAFAGQR NLTLLAPFAG NVYP RFVQIY IICIOAVYI.A ^ nr^runr 
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101 EQRWAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGAHQ 
151 PAFDQPGAIL PPRRQLARQR PTVQTALRQP PQRRRKIAPR QVLRHAACIF 
201 RRHLCQQCKQ FFQIAPVCRN RVLRLALAHD VFQISVKIRR KPVQNHNRPT 
251 QISKNQ+ 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 1>: 



m032 . seq 


(partial) 


1 


ATGCGGCGAA 


51 


AACGTTTTTG 


101 


AACAGGGCTT 


151 


CAGGGCTTCC 


201 


CTTTGCCGAT 


251 


TTCAAGCCGT 


301 


GAACAGGGCG 


351 


AATCCAGCAT 


401 


TCGGGCTGCT 


451 


GCCGCGCTTT 


501 


GAGCCAGCGT 



This corresponds to the amino acid sequence <SEQ ID 112; ORF 032>: 

m032 . pep (partial ) 

1 MRRNVXAVAV XRRPLRQTFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFADQR HLPLXAPFAD NVYFR XVQID IICIQAVYLA HA QTAAVHQF 

101 EQGWAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGVHQ 

151 AALYQPNAIL PPRRKLASQR PFPQTA. . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 13>: 

a032 . seq 

1 ATGCGGCGAA ACGTGCCTGC CGTCGCCGTT TTGCGCCGCC CATTGCGCCA 

51 AACGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAGGGCTTCC ACGCTTTTGC CGGTCAGCGG AACCTGCCGC TGCTTGCGTC 

201 CTTTGCCGGT AACGTGTACC CACGCCTCGT C CAAAT AT AC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGTTC 

301 GAACAGCGCG TGATCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 AATCCAGCAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

4 01 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG TATGCAGCAG 

4 51 ACCGCGTTTG ACCAGCCAGG CGCAATACTG CCGCCAAGAC GACAGCTTGC 

501 GCGCCAGCGT CCGCGCATTC AAACCGCGCT GCGACAGCCG CCGCAACGCC 

551 GCCGTAAAAT CGCGCTGCGA CAAGCCCTGC GGCACGCCGC CTGCATCTTC 

601 AGACGGCATT TGTGCCAACA GCGCAAACAG TTCTTCCAAA TCGCGCCGGT 

651 ATGCCGCCAC CGTGTGCTCC GACTTGCCCT CGCGCACGAT GTTTTCCAAA 

7 01 TAAGCGTCAA AATGCGCCGC AAACCCGTCC AAAACCATAA CCGCCCCACA 

7 51 CAAATATCAA AAAAACAGTG A 

This corresponds to the amino acid sequence <SEQ ID 1 14; ORF 032. a>: 

a032.pep 

1 MRRNVPAVAV LRRPLRQTFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFAGQR NLPLLASFAG NVYP RLVQIY IICIQAVYLA HA QTAAVHQF 

101 EQRVIAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGMQQ 

151 TAFDQPGAIL PPRRQLARQR PRIQTALRQP PQRRRKIALR QALRHAACIF 

201 RRHLCQQRKQ FFQIAPVCRH RVLRLALAHD VFQISVKMRR KPVQNHNRPT 

251 QISKKQ* 

xn032/a032 88.1% identity over a 176 aa overlap 

10 20 30 40 50 60 

m032 . pep MRRNVXAVAVXRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFADQR 
Mill I I I I I I I I I I I I I I I I I M I I I I I I I | | | | | | | | | | M | | | | | || | | i || || 
*032 MRRNVPAVAVLRRPLRQT FLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFAGQR 

10 20 30 40 50 60 
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m032 .pep 
a032 

m032 . pep 
a032 

a032 



100 



110 



120 



70 80 90 

HLPLXAPFADNVYPRXVQIDIICIQAVYLAHAQTAAVHQFEQGVVAHRQRVAAVHGQIOH 
: 11 1 1 11 1 1 1 J I HI I I M I I I I i M M I i I I I I M i I : If | | | | | j| I I I I I I 

NLPLLASFAGNVYPRLVQIYIICIQAVYLAHAQTAAVHQFEQRVIAHRQRVAAVHGQIOH 
70 80 90 100 HO 120 

130 140 150 160 170 

PVQPFLRQGFGYALGLLRRFDVGGRVGVHQAALYQPNAILPPRRKLASQRPFPQTA 
1 I 1 I" I I I I I i I II M I M M M M I I : : I : f : I I : [ I | I I I I : || III Ml 

PVQPFLRQGFGYALGLLRRFDVGGRVGMQQTAFDQPGAILPPRRQLARQRPRIQTALROP 
130 140 150 160 170 180 



PQRRRKIALRQALRHAACI FRRHLCQQRKQFFQIAPVCRHRVLRLALAHDVFQI SVKMRR 
190 200 210 220 



230 



240 



Computer analysis of this amino acid sequence gave the following results* 
Homology with a predicted ORF from K gonorrhoeae 

ORF 032 shows 86.4% identity over a 176 aa overlap with a predicted ORF (ORF 032 nel 



from N. gonorrhoeae 

m032/g032 



m032 .pep 
g032 

m032 -pep 
g032 

m032 .pep 
g032 

g032 



10 20 30 40 50 60 

MRRNVXAVAVXRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFADOR 
11,11 M 11 "I : : I I I M I II I I | M | M I II I I N I I I I I | | | | | I I M I! 11 

MRRNVPAVAVLRRPRFEAFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFAGOR 
10 20 30 40 50 fin 



60 



70 



80 



90 100 110 120 

HLPLXAPFADNVYPRXVQIDIICIQAVYLAHAQTAAVHQFEQGWAHRQRVAAVHGQIQH 
: * 1 I''' ""I Ml MMMMIIMIIIMII:!! I I I I I | I | || I I I I I M 
NLTLIAPFAGNVYPRFVQIYIICIQAVYLAHAQTAAVHQLEQRVVAHRQRVAAVHGQIOH 
70 80 90 100 HQ 120 



130 140 150 160 170 

PVQPFLRQGFGYALGLLRRFDVGGRVGVHQAALYQPNAILPPRRKLASQRPFPOTA 
inHflllHIIIIIIMMMMM:!! |: M:|||MM:|| ||| Ml 

PVQPFLRQGFGYALGLLRRFDVGGRVGAHQPAFDQPGAILPPRRQLARQRPTVQTALROP 
130 I 40 150 160 170 180 



PQRRRKI APRQVLRHAACI FRRHLCQQCKQFFQI APVCRNRVLRLALAHDVFQI S VKIRR 
190 200 210 220 



230 



240 



The following partial DNA sequence was 

9033 . seq 

ATGGCGGCGG CGGACAAACT CTTGGGCGGC 
CATCGGAGAC GGCGCGATGA CGGCGGGGCA 
GCGCGGGCGA TATGGATGTG GATTTGCTGG 
ATGTCGATTT CCCCCAACGT CGGCGCGTTG 
CGTCGTGCGC GATATGCACG GACTGTTGAG 
GCAAGGTATT AGACAAAATA CCCGGCGCGA 
GAACAcaaaA TCAAAACCCT TGCCGAAGAA 
GCTGTCGCTG TTTGAAAATT TCGGCTTCCG 
GACACAACGT CGAGAATCTG GTGGACGTAT 
AAAGGCCCTC AGTTGCTGCA CGTCATCACC 
ACTCGCCGAA AACGACCCcg tcaAATACCA 
AAGAAGGCGG GGCGCAAATg ccGTCTGAAA 
aaaccgACCT ATACCCAAGT ATTCGGCAAA 
GGCAGATTCC CGACTGGTTG CGATTACCCC 



identified in N. gonorrhoeae <SEQ ID 1 15>: 



51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



GACCGCCGCA 

GGCGTTTGAA 

TCGTCCTCAA 

CCCAAATATC 

TACCGTCAAA 

TGGagtTTGC 

GCCGAACACG 

CTACACCGGC 

TGAAAGACTT 

AAAAAGGGCA 

CGCCGTCGCc 

AAGAACCCAA 

TGGCTGTGCG 

CGCCATGCGC 



GCGTCGCCAT 
GCCTTGAATT 
CGACAACGAA 
TTGCCAGCAA 
GCGCAAAcgg 
CCAAAAAGTC 
CCAAACAGTC 
CCCGTGGACG 
GCGCAGCCGC 
ACGGCTACAA 
aACCTGCcta 
GCCCGCCgCc 
ACCGGGCGGC 
GAGGGCAGCG 
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701 GACTGGTGGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

751 ATCGCCGAGC AGCACGCCGT tacCTTTGCC GGCGGTTTGG CGTGCGAAGG 

8 01 CATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 

901 GTCGACCGTG CGGGCATCGT CGGCGCGGAC GGTCCGACCC ATGCCGGCTT 

951 GTACGATTTG AGCTTCTTGC GCTGTGTGCC GAACATGATT GTTGCCGCGC 

10 01 CGAGCGATGA AAACGAATGC CGCCTGCTGC TTTCGACCTG CTATCAGGCG 

1051 GATGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGCGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCC TTcatTGCCT TCGGCAGTAT GGTCGCCACC 

1201 GCATTGGCGG TTGCCGAAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTt 

1251 cgtcaaacCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCAcg 

13 01 accGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCGGC 

13 51 GCGGTCTTGG AAGTGTTGGC GAAACACGGC ATCTGCAAAC CCGTTTTGCT 
1401 TTTGGGCGTT GCCGATACCG TAACCGAACA CGGCGATCCG AAAAAACTTT 

14 51 TGGACGATTT GGGTTTGAGT GCCGAAGCGG TGGAACGCCG GGTGCGCGAG 
1501 TGGCTGCCGG ACCGTGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 116; ORF 033.ng>: 

g03 3 .pep 

1 MAAADKLLGG DRRSVAIIGD GAMTAGQAFE ALNCAGDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI PGAMEFAQKV 

101 EHKI KTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLKDLRSR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKEGGAQM PSEKEPKPAA 

2 01 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 
251 I AEQHAVT FA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 

3 01 VDRAGIVGAD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

3 51 DAPAAVRYPR GTGTGAPVSD GMETVEIGKG IIRREGE KTA FIAFGSMVAT 

4 01 ALAVAEKLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGG 
451 AVLEVLAKHG ICKPVLLLGV ADTVTEHGDP KKLLDDLGLS AEAVERRVRE 
501 WLPDRDAAN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 17>: 

m033 .seq 

1 ATGGCGGCGG CAGACAAACT CTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGCGAC GGCGCGATGA CGGCGGGGCA GGCGTTTGAA GCCTTGAATT 

101 GCGCaG.CGA TATGGATGTr GATTTGCTrG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGCGCGCTG CCGAAATACC TTGCCAGCAA 

201 CGTCGTGCGC GATATG CACG GCCTGTTGAG TACCGTCAAA GCGCAAACGG 

251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGAGTTTGC CCAAAAAGTC 

3 01 GAACACAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 
351 GCTGTCTTTG TTTGAAAACT TCGGCTTCCG CTACACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAAAATCTG GTGGACGTAT TGAAAGACTT GCGCAGCCGC 
451 AAAGGCCCTC AGTTGCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 
501 ACTCGCCGAA AACGACCCCG TCAAATACCA CGCCGTCGCC AACCTGCCTA 
551 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA GCCCGCCGCC 
601 AAACCGACCT ATACCCAAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 
651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 
7 01 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 
751 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 
801 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 
851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTACCCGT TTTGTTTGCC 
901 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTCT 
951 GTACGATTTG AGCTTTTTGC GCTGCGTGCC GAACATGATT GTCGCCGCGC 

1001 CGAGCGATGA AAACGAATGC CGCCTGTTGC TTTCGACCTG CTATCAGGCA 

1051 GACGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGCGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCC 

1201 GCATTGGCGG TTGCCGAAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

12 51 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 
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13 01 ACCGCATCGT TACCCTTGAA 

13 51 GCGGTGCTGG AAGTATTGGC 

14 01 TTTGGGCGTT GCCGATACCG 
14 51 TAGACGATTT GGGCTTGAGT 
1501 TGGCTGTCGG ATCGGGATGC 



GAAAACGCCG AACAGGGCGG CGCAGGCGGC 
GAAACACGGC ATCTGCAAAC CCGTTTTGCT 
TAACCGGACA CGGCGATCCG AAAAAACTTT 
GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 
GGCAAATTAA 



This corresponds to the amino acid sequence <SEQ ID 118; ORF 033>: 

m033 -pep 

1 MAAADKLLGS DRRSVAIIGD GAMTAGQAFE ALNCAXDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLS TVK AQTGKVLDKI PGAMEFAQKV 

101 EHKI KTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLKDLRSR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

2 01 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 
251 IAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 

3 01 VDRAGI VGAD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

3 51 DAPAAVRYPR GTGTGAPVSD GMETVEIGKG IIRREGE KTA FIAFGSMVAP 

4 01 ALAVAEKLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGG 
4 51 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP KKLLDDLGLS AEAVERRVRA 
501 WLSDRDAAN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 19>; 

a033 . seq 

1 ATGGCGGCGG CGGACAAACA GTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGCGAC GGCGCGATGA CGGCGGGTCA GGCGTTTGAA GCCTTGAACT 

101 GCGCGGGCGA TATGGATGTG GATTTGCTGG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGTGCGTTG CCCAAATACC TTGCCAGCAA 

2 01 CGTCGTGCGC GATATGCACG GACTGTTGAG TACCGTCAAA GCGCAAACGG 

251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGAGTTTGC CCAAAAAGTC 

301 GAACATAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 

351 ACTGTCTTTG TTTGAAAACT TCGGCTTCCG CTATACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAAAATCTG GTCGATGTAT TGGAAGACCT GCGCGGACGC 

4 51 AAAGGCCCGC AGCTTCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 

501 ACTCGCCGAA AACGATCCCG TCAAATACCA CGCCGTCGCC AACCTGCCTA 

551 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA GCCCGCCGCC 

601 AAACCGACCT ATACCCAAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 

651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 

7 01 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

7 51 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 

801 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 

901 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTTT 

951 GTACGATTTA AGCTTTTTGC GCTGCATTCC GAATATGATT GTCGCCGCGC 

1001 CGAGCGATGA AAATGAATGC CGCCTGCTGC TTTCGACCTG CTATCAGGCA 

1051 GACGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGTGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCT 

12 01 GCATTGGCGG TCGCCGGAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

1251 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 

1301 ACCGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCAGC 

1351 GCGGTGCTGG AAGTGTTGGC GAAACACGGC ATCTGCAAAC CCGTCTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGGACA CGGCGATCCG AAAAAACTTT 

14 51 TAGACGATTT GGGCTTGAGT GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 

1501 TGGCTGTCGG ATCGGGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 120; ORF 033. a>: 

a033 .pep 

1 MAAADKQLGS DRRSVAIIGD GAMTAGQAFE ALNCAGDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLS TVK AQTGKVLDKI PGAMEFAQKV 

101 EHKIKTLAEE AEHAKQSLSL FEN FGFRYTG PVDGHNVENL VDVLEDLRGR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

2 01 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

251 IAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 
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301 VDRAGIVGAD GPTHAGLYDL SFLRCIPNMI 

351 DAPAAVRYPR GTGTGVPVSD GMETVEIGKG 

4 01 ALAVAGKLNA TVADMRFVKP IDEELIVRLA 

4 51 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP 

501 WLSDRDAAN* 

m033/a033 98.4% identity over a 509 aa overlap 

10 20 30 40 50 60 

Iti033 . pep MAAADKLLGSDRRSVAIIGDGAMTAGQAFEALNCAXDMDVDLLWLNDNEMSISPNVGAL 
I I I I I I I I I M I I I I I I I I I I I I I I I I i I I I I I I I I I M I I I I I I I | | | | M I I I I II 
a033 MAAADKQLGSDRRSVAIIGDGAMTAGQAFEALNCAGDMDVDLLVVLNDNEMSISPNVGAL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m033 .pep PKYLASNWRDMHGLLSTVKAQTGKVLDKIPGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 
I I I I I I M I I I I I I I! I I I I I M M I I I I I I I I I I f I I I M 1 I I I I I I I I I I I I i | | | | | 
a033 PKYLASNVVRDMHGLLSTVKAQTGKVLDKIPGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m033 . pep FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 
I I I I I I I I M I I I I I M I I I ! I I I : I I I : I I I M i II I I I I I II II I I I I I I I I I I I I I I 
a033 FENFGFRYTGPVDGHNVENLVDVLEDLRGRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m033 . pep NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 
I I I I I I I I N I I I I I I M I I I I I I I I I I I I I I I I I I I I I I ( I I I M I II I M I I I I I I I I 
a033 NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

m033 . pep RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNLPVLFA 
M I I II j I I I I M I I I I I ! I I I I 11 I I I II I I I 1 M I I I M II I I I I I I I I I I I I I I I I I 
a033 RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNLPVLFA 

250 260 270 280 290 300 

310 320 330 340 350 360 

VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 
I I I I M I I I I I M I I I I II I I I I I I : 1 I I i I I I I I I I M 1 I I M I I I 11 I I I M I I II I I 
VDRAGIVGADGPTHAGLYDLSFLRCIPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 
310 320 330 340 350 360 

370 380 390 400 410 420 

m03 3 . pep GTGTGAPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVAPALAVAEKLNATVADMRFVKP 
I I I I I : I I I I I I I I I M I I I I M I I M M I I I I M I I I 11 I M I I I I I I I I I II I I I I I 
a033 GTGTGVPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVAPALAVAGKLNATVADMRFVKP 

370 380 390 400 410 420 

430 440 450 460 470 480 

m033 . pep IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTGHGDP 

I I I i I I I I I I I I I I I M I I I I M I I I I I I : I I I I M I I I 1 I I II I I I I i I I I M I 

a033 IDEELIVRLARSHDRIVTLEENAEQGGAGSAVLEVLAKHGICKPVLLLGVADTVTGHGDP 

430 440 450 460 470 480 

490 500 510 

m033 . pep KKLLDDLGLSAEAVERRVRAWLSDRDAANX 
I I I I I I I I I II I 1 I I I I I I I M I I I II I I I 
a 0 3 3 KKLLDDLGLS AEAVERRVRAWLS DRDAANX 

490 500 . 510 



VAAPSDENEC RLLLSTCYQA 
I 1RREGEK TA FIAFGSMVAP 
RSHDRIVTLE ENAEQGGAGS 
KKLLDDLGLS AEAVERRVRA 



m033 .pep 
a033 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 033 shows 98.4% identity over a 509 aa overlap with a predicted ORF (ORF 033 ng) 
from N. gonorrhoeae: ' 



m033/g033 






m033 


-Pep 


MAAADKLLGSDRRSVAI IGDGAMTAGQAFEALNCAXDMDVDLLWLNDNEMS I SPNVGAL 

1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | IIIMIIIIIIMlllliiiMM 

MAAADKLLGGDRRSVAI I GDGAMTAGQAFEALNCAGDMDVDLLVVLNDNEMS I S PNVGAL 


60 


g033 




6 0 


m033 


-pep 


PKYIiASNWRDMHGLLSTVPCAQTGKVLDKI PGAMEFAQKVEHKI KTLAEEAEHAKOSLSL 

'J" ^ ' M 1 1 1 M 1 1 1 1 M 1 1 1 1 i 1 1 1 [ M 1 1 1 ! t M i 1 1 1 1 1 i 1 1 1 ! f 1 1 f f 1 1 1 ??tT? 


120 


g033 




PKYLASNWRDMHGLLSTVKAQTGKVLDKI PGAMEFAQKVEHKI KTI^^ 


J. ^ u 


m033 


-Pep 


FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 




9033 




''NIIIMMMIMIIMIIIMIMIIMIIIIMIIMIIiMIIIMMIIMII 

FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLH^ 


"ton 
J. o V 


m033 


•pep 


NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEn 

IIII|::|MM| HIIIIMIIIMIIIIMIIIIMMIIMIMI?MII?MIN| 


24 0 


g033 




NLPKEGGAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 


24 0 


m033 . 


.pep 


RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAVDQIjVHDIALONLPVLFA 

''NMIMIIMIIIIIIMMMINIiMIIIIMIIIMIIIIIimymMM 


T A A 


g033 




RFPDRYFDVGIAEQHAVTFAGGLACEGMKPVVAIYSTFLORAYDOLVHDTAT ONTT pv IL 


300 


tn033 . 


pep 


v UKAb 1 v fcrADGP THAGL YDLS FLR CVPNM I VAAPSDENE CRLLL S TC YO ADAP A A VP V P p 

NMIilliiMIIIIIMilUIIIIIIIIIIMUlM 

VDP^GIVGADGPTHAGLYDLSFLRCVPmiVAAPSDENECRLLLSTCYQADAPAATOYPR 


360 


g033 




360 


m033. 


pep 


GTGTGAPVSDGMETVE IGKGI I RREGEKTAFIAFGSMVAPALAVAEKLNATVADMRFVKP 


420 


g033 




M Mill MM IIIMIIIMIIMIII MINIMI I | II 1 II 1 1 II 1 II 1 1 1 II 1 1 1 

GTGTGAPVSDGI^WEIGKGIIRREGEKTAFIAFGSMVATALAVAEKLNATOADM 


420 


m033 . 


pep 


IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTGHGDP 
IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVIAKHGI^ 


480 


g033 




480 


m033.; 


pep 


KKLLDDLGLSAEAVERRVRAWLSDRDAANX 510 




g033 




MIMMMIMMIIIII 1! IMIMI 

KKLLDDLGLSAEAVERRVREWLPDRDAANX 510 





The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 121>- 

9T034 . seq 

1 ATGAGCCGTT TATGGTTTTT TGCCGTAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGACCA CGCCGCCGAA AACAGCTACG GCCTGCCCGC GTTCAACGTC 

151 AACAACCTCG AACAAATGCG CGCCATTATG GAAGCCGCCG ACCAAGTCAA 

201 CGCGCCCGTC ATCGTACAGG CGAGCGCAGG TGCGCGCAAA TACGcggGCG 

251 CGCCGTTTTT GCGCCACCTG ATTCTGGCGG CAGTCGAAGA ATTTCCGCAC 

3 01 ATCCCCGTCG TGATGCACCA AGACCACGGC GCATCGCCCG ACGTgtgCCA 

3 51 ACGCTCCATC CAACTGGGCT TCTCCTCCGT GATGATGGAC GGCTCTTTGC 

4 01 TCGAAGACGG CAAAACCCCT TCTTCTTACG AATACAACGT CAACGCCACC 
4 51 CGTACCGTCG TCAACTTCTC CCACGCCTGC GGCGTGTCCG TCGAAGGCGA 
501 AATCGGCGTA TTGGGCAACC TCGAAACCGG CGAAGCAGGC GAAGAAGACG 
551 GAGTGGGCGC GGCAGGCAAA CTCTCACACG ACCAAATGCT CACCAGCGTT 
6 01 GAAGATGCCG TGCGTTTCGT TAAAGATACC GGCGTTGACG CATTGGCGAT 
651 TGCCGTCGGC ACCAGCCACG GCGCATACAA ATTCACCCGT CCGCCCACAG 
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701 GCGACGTATT GCGTATCGAC CGCATCAAGG AAATCCACCA AGCCCTGCCC 

751 AATACACACA TCGTGATGCA CGgctCCAGC TCCGTTCCGC AAGAatgGCT 

801 GAAAGTCATC AACGAATACG GCGGCAATAT CGGCGAAACC TACGGCGTGC 

851 CGGTTGAAGA AATCGTCGAA GGCATCAAAC ACGGCGTGCG CAAAGTCAAC 

901 ATCGATACCG ACCTGCGCCT CGCTTCCACC GGCGCGGTAC GCCGCTACCT 

951 TGCCGAAAAC CCGTCCGACT TTGATCCGCG CAAATACTTG GGCAAAACCA 

1001 TTGAAGCGAT GAAGCAAATC TGCCTCGACC GTTATCTTGC GTTCGGTTGC 

1051 GAAGGTCAGG CAGGCAAAAT CAAACCTGTT TCGTTGGAAA AAATGGCAAG 

1101 CCGTTATGCC AAGGGCGAAT TGAACCAAAT CGTCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 122; ORF 034.ng>: 



1 MSRLWFFAVK NIIIRLIYLL PKETQMALVS MRQLLDHAA E NSYGLPAFNV 

51 NNLEQMRAIM EAADQVNAPV I VQAS AGARK YAGAPFLRHL ILAAVEEFPH 

101 I PWMHQDHG ASPDVCQRSI QLGFSSVMMD GSLLEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGEAG EEDGVGAAGK LSHDQMLTSV 

201 EDAVRFVKDT GVDALAI AVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMHGSS SVPQEWLKVI NEYGGNIGET YGVPVEEIVE GIKHGVRKVN 

301 IDTDLRLAST GAVRRYLAEN PSDFDPRKYL GKTIEAMKQI CLDRYLAFGC 

3 51 EGQAGKI KPV SLEKMASRYA KGELNQIVK* 



The following partial DNA sequence was identified in meningitidis <SEQ ID 123>: 

m034.seq (partial) 



1 ATGAGCTGTT TATGGTTTTT TGCTGTAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGATCA TGCTGC CGAA wACAGCTACG GCyTGCCGGC GTTCAACGTC 

151 AACAACCTCG wACAGATGCG CGCCATCATG GAGGCTGCAG ACCAAGTCGA 

201 CGCCCCCGTC ATCGTACAGG CGAGTG CCGG TGCGCGCAAA TATGCGGGTG 

2 51 CGCCGTTTTT ACGCCACCTG ATTTTGGCGG CTGTCGAAGT ATTTCCACAC 

3 01 ATCCCCGTCG TCATGCACCA AGACCACGGC GCATCACCCG ACGTGTGCCA 

3 51 ACGCTCCATC CAACTGGGCT TCTCCTCTGT AATGATGGAC GGCTCGCTGA 

4 01 TGGAAGACGG CAAAACCCCT TCTTCTTACG AATACAACGT CAACGCCACA 
4 51 CGTACCGTGG TTAACTTCTC CCACGCTTGC GGCGTATCCG TTGAAGGCGA 
501 AAT CGGCGTA TTGGGCAACC TCGAAACCGG CGATGCAGGC GAAGAAGACG 
551 GTGTAGGCGC AGTGGGCAAA CTTTCCCACG ACCAAATGCT GACCAGCGTC 
601 GAAGATG CCG TATGTTTCGT TAAAGATACC GGCGTTGACG CATTGG CT AT 
651 TGCCGTCGGC ACCAGCCACG GCGCATACAA ATTCACCCGT CCGCCCACAG 
701 GCGATGTATT ACGTATCGAC CGCATCAAAG AAATCCACCA AGCCCTGCCC 
751 AATACACACA TCGTGATGCA C. . . 



This corresponds to the amino acid sequence <SEQ ID 124; ORF 034>: 

xn034.pep (partial) 



1 MSCLWFFAVK NIIIRLIYLL PKETQMALVS MRQLLDHA AE XSYGLPAFNV 

51 NNLXQMRAIM EAADQVDAPV IVQASAGARK YAGAPFLRHL I LAAVEVFPH 

101 I PWMHQDHG ASPDVCQRSI QLGFSSVMMD GSLMEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGDAG EEDGVGAVGK LSHDQMLTSV 

201 EDAVCFVKDT GVDALAI AVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMH. . . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 125>: 



1 ATGAGCCGTT TATGGTTTTT TGCCGCAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGATCA TGCTGCCGAA AACAGCTACG GCCTGCCCGC GTTCAACGTC 

151 AACAACCTCG AACAAATGCG CGCCATTATG GAAGCCGCCG ACCAAGTCAA 

2 01 CGCGCCCGTC ATCGTACAGG CGAGCGCAGG TGCGCGCAAA TACGCGGGCG 

251 CGCCGTTTTT GCGCCACCTG ATTTTGGCGG CTGTCGAAGA ATTTCCGCAC 

301 ATCCCCGTCG TGATGCACCA AGACCACGGC GCATCGCCCG ACGTGTGCCA 

351 ACGCTCCATC CAACTGGGCT TTTCCTCCGT GATGATGGAC GGCTCGCTGA 

4 01 TGGAAGACGG CAAAACCCCT TCTTCTTATG AATACAACGT CAACGCCACC 



g034 .pep 



a034 . seq 
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4 51 CGTACCGTGG TTAATTTCTC CCACGCCTGC GGCGTATCCG TTGAAGGCGA 

501 AATCGGCGTA TTGGGCAACC TCGAAACTGG CGAAGCCGGC GAAGAAGACG 

551 GTGTAGGCGC AGTGGGCAAA CTTTCCCACG ACCAAATGCT CACCAGCGTC 

601 GAAGATGCCG TGCGTTTCGT TAAAGATACC GGCGTTGACG CATTGGCGAT 

651 TGCCGTCGGC ACCAGCCACG GCGCGTACAA ATTCACCCGT CCGCCCACAG 

7 01 GCGACGT GTT GCGTATCGAC CG CATCAAAG AAATCCACCA AGCCCTGCCC 

7 51 AAT AC A C AC A TCGTGATGCA CGGCTCCAGC TCCGTTCCGC AAGAAT GGCT 

801 GAAAGTCATC AACGAATACG GCGGCAATAT CGGCGAAACC TACGGCGTGC 

851 CGGTTGAAGA AATCGTCGAA GGCATCAAAC ACGGCGTGCG TAAAGTCAAC 

901 ATCGATACCG ACTTGCGCCT TGCTTCCACC GGCGCGGTAC GCCGCTACCT 

951 TGCCGAAAAC CCGTCCGACT TCGATCCGCG CAAATATTTG AG C AAAAC C A 

1001 TTGAAGCGAT GAAGCAAATC TGCCTCGACC GCTACCTCGC GTTCGGTTGC 

1051 GAAGGTCAGG CAGGCAAAAT CAAACCGGTT TCCTTGGAAA AAATGGCAAA 

1101 CCGTTATGCC AAGGGCGAAT TGAACCAAAT CGTCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 126; ORF 034.a>: 

a034.pep 

1 MSRLWFFAA K NIIIRLIYLL PKETQMALVS MRQLLDHAA E NSYGLPAFNV 

51 NNLEQMRAIM EAADQVNAPV IVQASAGARK YAGAPFLRHL ILAAVEEFPH 

101 IPVVMHQDHG ASPDVCQRSI QLGFSSVMMD GSLMEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGEAG EEDGVGAVGK LSHDQMLTSV 

201 EDAVRFVKDT GVDALAIAVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMHGSS SVPQEWLKVI NEYGGNIGET YGVPVEEIVE GIKHGVRKVN 

301 IDTDLRLAST GAVRRYLAEN PSDFDPRKYL SKTIEAMKQI CLDRYLAFGC 

351 EGQAGKIKPV SLEKMANRYA KGELNQIVK* 

m034/a034 96.9% identity over a 257 aa overlap 

10 20 30 40 50 fin 

m034.pep MSCLWFFAVKNIIIRLIYLLPKETQMALVSMRQLLDHAAEXSYGLPAFNVNNLXQMRAIM 
11 NMCIHMIIIMMIMIIMMillMIIMI IIIMIMMfl II (111 
MSRLWFFAAKNIIIRLIYLLPKETQMALVSMRQLLDHAAENSYGLPAFNWNLEQM^IM 
10 20 30 40 50 60 

70 80 90 100 HO 120 

EAADQVDAPVIVQASAGARKYAGAPFLRHLILAAVEVFPHIPVVMHQDHGASPDVCORSI 

I I I N I : I I I I I M M 1 I II I I j | M I I It I I I I I I I II I I I I I II I M I I I I I 

E AADQVN APVI VQAS AGARKYAGAPFLRHL I LAAVEE FPHI P WMHQDHGAS PDVCORS I 
70 80 90 100 HO 120 

130 140 150 160 170 180 

QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTWNFSHACGVSVEGEIGVLGNLETGDAG 
M 1 M 1 1 1 1 1 ' i ' I I I M f I I I I 11 II I M I I M I [ M I I [| I II I I I i I I I I M M • t I 
QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTVWFSHACGVSVEGEIGVLGNLETGEAG 
130 140 150 160 170 180 

190 200 210 220 230 240 

m034.pep EEDGVGAVGKLSHDQMLTSVEDAVCFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRTn 
M I 1 1 I M I I I I I I I II M I I I I I I I I I I I M I I M I M I I I I fj I I I I I I I I I I II I I 
EEDGVGAVGKLSHDQMLTSVEDAVRFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 
190 2 00 210 220 230 240 



a034 



m03 4 .pep 
a034 

m034 . pep 
a034 



a034 



250 

m034 .pep RIKEIHQALPNTHIVMH 
M II I I I I I M | | [ | M 

a034 rikeih Q a l™thivmhgsssvpqewlkvineyggnigetygvpveeivegikhgvrkvn 

250 260 270 280 290 300 

Computer analysis of this amino acid sequence gave the following results* 
Homology with a predicted ORF from K gonorrhoeae 

ORF 034 shows 96.5% identity over a 257 aa overlap with a predicted ORF ( ORF 034 
from N. gonorrhoeae: 5; 
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m034/g034 






mm A 
nf\ 1 A 

yyj4 
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MSCLWFFAVKNIIIRLIYLLPKETQMALVSMRQLLDHAAEXSYGLPAFNVNNLXQMRAIM 

II IN II M 1 II 1 II ! IMMIIII 1 1 1 1 MINI MM MIMIIIMM IIIMI 

MS RLW F FAVKN 1 1 IRLI YLLPKETQMALVSMRQLLDHAAENSYGIiPAFNVNNLEQMRAIM 


60 
60 


m03 4 


. pep 


EAADQVDAPVI VQASAGARKYAGAPFLRHIjILAAVEVFPHI P WMHQDHGAS PDVCQRS I 

1 III M M M 1 1 1 1 M 1 MIM Mill M 1 i MM 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 II M M 1 

EAADQVNAPVI VQASAGARKYAGAPFLRHLI LAAVEEFPHI PWMHQDHGAS PDVCQRS I 


120 






120 


m034 


.pep 


QLGFSSVM^GSLMEDGKTPSSYEYNVNATRTVVNFSHACGVSVEGEIGVLGNLETGDAG 

I 1 1 ! 1 1 1 1 1 1 1 1 1 • 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 M 11 i M 1 M 1 1 1 1 1 1 1 1 t 1 1 M 1 1 1 . 1 1 
M 1 I 1 II 1 1 1 II 1 • 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 II 1 1! If 1 f II 1 1 1 1 1 1 1 1 M 1 M 1 1 : 1 1 

QLGFSSVMMDGSLLEDGKTPSSYEYNVNATRTVVNFSHACGVSVEGEIGVLGNLETGEAG 


180 


g034 




180 


m034 


.pep 


EEDGVGAVGKLSHDQMLTSVEDAVCFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 

llllllhllMIIIIMIIIIII IIIIIIIIIIMIIIMIIIIIMMIIIMIIII 

EEDGVGAAGKLSHDQMLTSVEDAVRFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 


240 


g034 




240 


m034 , 


.pep 


RIKEIHQALPNTHIVMH 

i t i 1 1 1 1 i 1 1 1 1 1 1 1 1 ) 


257 


g034 




RI KEIHQALPNTHI VMHGSSSVPQEWLKVINEYGGNIGETYGVPVEEI VEGI KHGVRKVN 


300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 127>: 

g036.seq 

1 ATGCTGAAGC CGTGTTTGGT ATACAGTGCC TGTGCGGCGG cgttgcCTGC 

51 GCGGACTTCG AGCAGCAGGC GTTGCGTGCC TTCGGGCAGA TGTGCGTACC 

101 AATATTCGAG CAGGGCGGAC GCAACGCCCC GTCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCTGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

2 51 GCGAAACAAG CGCGGACTCA AATTGGCGTT GCGTCCACGC GGACGGGTTG 

301 CAGACGGTAT CGAGCGCGGC CAGTGCGGCG CAGTCGGACG GTGAGGCTGG 

351 GCGGATGTTC ATGTTCGTGC CTTCCGTTCC GCCTGTTCTT TGGCAGTCAG 

401 GGCGATTTTG TTGCGGACGT AGAGCAGTTC GGCGTGTGCC GCGCCAGTTG 

4 51 CGGGATAGCC GCCGCCGAGG GCGAGCGCGA GAAAATCGGC GGCGGTCGGC 

501 ATATCGGGTT TGCCTGAGAA GGGCGGACGG TTTTCCAGTG CGAACGCACT 

551 GCCGATGCCG TCTGAAAAGA CGTACCCCTC GGGGAGGGCA ATGTCTGCCG 

601 CCCTACCGAC TTGATAATCG CTCAAACGGC GGCGGTTCAG CGTGTCGAAC 

651 CACG CAT AAA ACACTTCGCC CATACGCGCG TCCGCAGCGG CGAGTATGCA 

7 01 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGTG GGGATGCCGA 

751 TTAAAGGCGT GTCGAACGGC GTTGCCAAAC CTTGCGCCAC GCCGATGCCG 

801 ATACGCAGTC CGGTAA 

This corresponds to the amino acid sequence <SEQ ID 128; ORF 036.ng>: 

g036.pep 

1 MLKPCLVYSA CAAALPARTS SSRRCVPSGR CAYQYSSRAD ATPRRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA ILPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTVSSAASAA QSDGEAGRMF MFVPSVPPVL WQSGRFCCGR RAVRRVPRQL 

151 RDSRRRGRAR ENRRR S AYR V CLRRADGFPV RTHCRCRLKR RTPRGGQCLP 

201 PYRLDNRSNG GGSACRTTHK TLRPYARPQR RVCSFAAAAA RRRHRAWGCR 

251 LKACRTALPN LAPRRCRYAV R* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 129>: 

m036. seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCg . CGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAAC C AG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGCGGC CAGCTCCTCA CAATCGGCAC AAACGGCACG 
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351 GCGGATGTTC ACGGGCGCGC TCTCCGTTCG GCCTGTTCTT TGGCAGTCAG 

4 01 GGCGATTTTG TTGCGGACGT AGAGCAAACC GGCGTGTGCG GCATGGACGG 

4 51 CAGGATAACC GCCCTTGGCT GCCAATGCGA GAAAGTCGGC GGCAGTCGGC 

501 ATATCCGGTC TGCCTGAGAA CGGCGGAGCT TCTTCCAGCG CGAACGCGCT 

551 GCCTATGCCG TCTGAAAAGG CGCATCCCTC CGGCAGCCGG ATGTCTGCCG 

601 CCCGCCCGAC CTGATAATCG CTCAAACGGT GGCAGTTCAG CGTATCGAAC 

651 CAT GCATAAA ACACTTCGCC CATACGAGCG TCCGTAGCGG CAAGGATGCA 

701 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGAG GGTACGCCGA 

7 51 TTAAGGGGGT ATCAAACGGC GTTGCCAAAC CCTGAGCTAC ACCGATGCCG 

801 ATACGCAGTC CGGTAA 

This corresponds to the amino acid sequence <SEQ ID 130; ORF 036>: 

m036 .pep 

1 MLKPCAVYSA CAAVL PARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPXSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASSS QSAQTARRMF TGALSVRPVL WQSGRFCCGR RANRRVRHGR 

151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 

201 PARPDNRSNG GSSAYRTMHK TLRPYERP*R QGCSFAAAAA RRRHRARVRR 

251 LRGYQTALPN PELHRCRYAV R* 



GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 



a036 . seq 






1 


ATGCTGAAGC 


CGTGCGCCGT 


51 


ACGGACTTCG 


AGCAGCAGGC 


101 


AATATTCGAG 


CAGGGCGGAC 


151 


GTGGCAATCA 


GGTGCAGTTC 


201 


AAAGGCGGCA 


ATCCCGCCGT 


251 


GCGAAACCAG 


TGCGGACTCA 


301 


CAGACGGCAT 


CGAGCGCGGC 


351 


GCGGATGTTC 


ACAGGCGCGC 


401 


GGCGATTTTG 


TTGCGGACGT 


451 


CGGGAAAACC 


GCCTTCAGCC 


501 


ATATCGGGTT 


TGCCTGAGAA 


551 


GCCGATGCCG 


TCTGAAAAGG 


601 


CCCGACCGAC 


CTGATAATCG 


651 


CATGCATAAA 


ACACTTCGCC 


701 


GCTTTGCGGC 


GGCGGCAGCG 


751 


TTAAAGGAGT 


ATCAAACGGC 


801 


ATACGCAGTC 


CCGTAA 



CCTCCGTTCC GCCTGTTCTT TGGCAGTCAA 



CTCAAACGGC GGCGGTTCAG CGTGTCGAAC 



This corresponds to the amino acid sequence <SEQ ID 132; ORF 036.a>: 

a036.pep 

1 MLKPCAVYSA CAAVL PARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASAA QSA*TARRMF TGAPSVPPVL WQSRRFCCGR RAARRVPQRR 

151 RENRLQPPD* GSRRRSAYRV CLRRADGFPA RTHCRCRLKR RILPAAGCLP 

2 01 PDRPDNRSNG GGSACRTMHK TLRPYVRPQR QGCSFAAAAA RRRHRARVRR 

251 LKEYQTALPN LAPRRCRYAV P* 

m036/a036 85.6% identity over a 270 aa overlap 

10 20 30 40 50 60 

m036.pep MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAI PWRRHSGAVAIRCSSDSS 

1 1 11 11 11 1 1 ' ' I I f I I M M I I I I I I I I I | | | M M I I M I H | M I I I I I ! M I I I I t 
a036 MLKPCAVYSACAAVL PART SSSRRCVSSGRCVNQYSSRADAI PWRRHSGAVAIRCSSDSS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m036 . pep GRFCQTIKAAIPXSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 
M U M I I I I I I f I I I I i I I I I I I I | | | | M I I I I i I 1 M | I I I II I :: I I I | | | || I 
a036 GRFCQTIKAAIPPSFSARKTCSDGETSADSNWRCVHADGLQTASSAASAAQSAXTARRMF 

70 80 90 100 110 120 
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130 140 150 160 170 180 

m036 . pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
Ml II I I 1 I I I I I I I I I I I III : I : : I I I I I I : I I I MM I : Ml 
a036 TGAPSVPPVLWQSRRFCCGRRAARRVPQRRRENRLQPPDXGSRRRSAYRVCLRRADGFPA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m036.pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPXRQGCSFAAAAA 
MM M I M M I I II I I II M I M I M I M I I II I I I I I M II M II I I I M If 
a036 RTHCRCRLKRRILPAAGCLPPDRPDNRSNGGGSACRTMHKTLRPYVRPQRQGCSFAAAAA 

190 200 210 220 230 240 



250 260 270 

m036 . pep RRRHRAR VRRLRG YQTAL PN PELHRCRY AVRX 

I I M M I II M : I M I I I I- M II I I I 
a03 6 RRRHRARVRRLKE YQT ALPN L APRRCR Y AV PX 

250 260 270 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 036 shows 74.9% identity over a 271 aa overlap with a predicted ORF (ORF 036.ng) 
from N. gonorrhoeae: 

m036/g036 

10 20 30 40 50 60 

m036.pep MLKPCAVYSACAAVLPARTS S SRRCVS SGRCVNQYS SRADAI PWRRHSGAVAI RCSS DSS 

MEM II I II I I M II I II I I II II MM: I M I I II I I M I I I I II M II II II 
9036 MLKPCLVYSACAAALPARTSSSRRCVPSGRCAYQYSSRADATPRRRHSGAVAIRCSSDSS 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 3 6 . pep GRFCQT IKAAI PXS FSARKTCS DGETSADSNWRCVHADGLQTAS SAAS S SQS AQTARRMF 

I I M M I M M I I II I I I II II I M M M M I II M II I I M I I II : M I I Ml 
g036 GRFCQTIKAAILPSFSARKTCSDGETSADSNWRCVHADGLQTVSSAASAAQSDGEAGRMF 

70 80 90 100 110 120 

130 140 150 160 170 180 

m03 6 . pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
: II I I i I I M I! I I I I M I II : MM II M I M II II M I : M : 

903 6 MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 

130 140 150 160 170 180 

190 200 210 220 230 240 

m03 6.pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPXRQGCSFAAAAA 
I I : I M I II I I :: I I I |. i || I II I I M I II I I I I II I II I : II II I M I 
9036 RTHCRCRLKRRT PRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCSFAAAAA 

190 200 210 220 230 240 



250 260 270 

mO 3 6 . pep RRRHRAR VRRLRG YQTAL PN PELHRCRY AVRX 

II II II II:: Mill) M M M II I 
g03 6 RRRHRAWGCRLKACRTALPKLAPRRCRYAVRX 

250 260 270 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 133>: 

m036-l . seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGCGGC CAGCTCCTCA CAATCGGCAC AAACGGCACG 
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351 GCGGATGTTC ACGGGCGCGC TCTCCGTTCG GCCTGTTCTT TGGCAGTCAG 

4 01 GGCGATTTTG TTGCGGACGT AGAGCAAACC GGCGTGTGCG GCATGGACGG 

4 51 CAGGATAACC GCCCTTGGCT GCCAATGCGA GAAAGTCGGC GGCAGTCGGC 

501 ATATCCGGTC TGCCTGAGAA CGGCGGAGCT TCTTCCAGCG CGAACGCGCT 

551 GCCTATGCCG TCTGAAAAGG CGCATCCCTC CGGCAGCCGG ATGTCTGCCG 

601 CCCGCCCGAC CTGATAATCG CTCAAACGGT GGCAGTTCAG CGTATCGAAC 

651 CATGCATAAA ACACTTCGCC CATACGAGCG TCCGTAG 

This corresponds to the amino acid sequence <SEQ ID 134; ORF 0036- 1>: 

m036-l.pep 

1 MLKPCAVYSA CAAVL PARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 
51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDGETSADS NWRCVHADGL 
101 QTASSAASSS QSAQTARRMF TGALSVRPVL WQSGRFCCGR RANRRVRHGR 
151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 
201 PARPDNRSNG GSSAYRTMHK TLRPYERP* 

m036-i/ g o36 76.8% identity in 228 aa overlap 



g03 6 



m03 6-1 . pep 



10 20 30 40 so go 

m03 6-l.pep MLKPCAVYSACAAVLPART SSSRR CVS SGRCVNQYSSRADAIPWRRHSGA VAIRCSSDSS 
Hilt I I N I I I : I II I I I I j | | | | mi: IMIIIM i MMMMIMMIII 

MLKPCLVYSACAAALPARTSSSRRCVPSGRCAYQYSSRADATPRRRHSGA VAIRCSSDSS 
10 20 30 40 50 60 

70 80 90. 100 HO 120 

GRFCQTIKAAIPPSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSS QSAQTARRMF 
'MINIMI! MIIMIIMMIIIIIIIMMIIMIII:MMI::|| | Ml 
g036 GRFCQTIKAAILPSFSARKTCSDGETSADSNWRCVHADGLQTVSSAASAAQSDGEAGRMF 

70 80 90 100 no 120 

130 140 150 160 170 180 

m03 6-1 . pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
= H Ml MINIMI I II III : MM II M I : II I I I I I I • M * 

g036 MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 

130 140 150 160 170 180 

190 200 210 220 229 

m036-l . pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPX 
MM I I I II I I : : II I I I II I M II M i II I I II I If II 

RTHCRCRLKRRTPRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCSFAAAAA 
190 200 210 220 230 240 



g036 



9036 RRRHRAWGCRLKACRTALPNLAPRRCRYAVRX 

250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 135>* 

g038.seq v: 

1 ATGACTGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTC AACGACGGCG CGTCCACGCT GCAACTGGCA 

151 AAATTCTATG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGCCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC GCGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 



401 AATCAATCAA ACTGATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

501 

551 ACGATTTGTT TATCCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 



501 ^f^Z™ AAAAGGCACG GGTAAATTGT CCGCCGTTCA 



GGAAGTGGAA AAACAATACG GCCTGCCCGT CGCCCCCATC GCCAGCCTGA 
ACGATTTGTT TATCCTGTTG CAAAACAACC CCGAATTCGG ACS 
601 GAACCCGTCC GCACCTACCG CCGGCAGTAC GGCGTAGAAT AA 
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This corresponds to the amino acid sequence <SEQ ID 136; ORF 038.ng>: 

g038 .pep 

1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR S P YFFNAGLF NDGASTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDRGEGGVL VGAPLKGRVL IIDDVTSAGT SVRESIKLIE AE GAT P AGVA 

151 IALDRMEKGT GKLSAVQEVE KQ YGLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRTYRRQY GVE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 137>: 

m038 . seq 

1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAGGC AGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTT AACGACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATTTTACG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGTCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC ACGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCGAT CAA ACTGATTGAA GCGGAGGGTG CAACCCCcGC CGGTGTCGCC 

451 ATCGCGCTCG ATCGCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

501 GGAAGTGGAr AAACAATACG GkCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GAGCCTACCG TCGGCAGTAC GGCGTAGAAT AA 

This corresponds to the amino acid sequence <SEQ ED 138; ORF 03 8>: 

m038 . pep 

1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SP YFFNAGLF NDGLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDHGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AE GAT P AGVA 

151 IALDRMEKGT GELSAVQEVE KQ YGLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRAYRRQY GVE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 139>: 

a038 . seq 

1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTCA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTT AACGACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATTTTACG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGCCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC ACGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCGATCAA ACTGATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

4 51 ATCGCGCTCG ACCGCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

501 GGAAGTGGAA AAACAATACG GCCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GAGCCTACCG TCGGCAGTAC GGCGTAGAAT AA 

This corresponds to the amino acid sequence <SEQ ED 140; ORF 038.a>: 

a038.pep 

1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDHGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGAT PAGVA 

151 IALDRMEKGT GELSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPEFGQFL 

2 01 EPVRAYRRQY GVE* 

m038/a038 100.0% identity over a 213 aa overlap 

10 20 30 40 50 60 

in038 . pep MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
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a038 



f ' 1 I I I I I I I I I I I I I I I I I I M I I I I II II I I I I I I I ! I I | || | I I I I M I J M I I I 
MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
10 20 30 40 50 60 



70 80 90 100 HO 120 

m038 . pep GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 

" I N I I I M I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | I I I I I |] I I I I II 

a038 GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m038 . pep IIDDVISAGTSVRESIKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 
1 1 ! f 1 ' I ' I I I I I I | I ! | I | | M I I I I I I M I I I I I f I M | | | M II M I M H I M I | | 
a 0 3 8 11 DDVI SAGTSVRE S I KL I EAEGAT PAGVAI ALDRMEKGTGELS AVQEVEKQYGLPVAP I 

130 140. 150 160 170 180 

190 200 210 

m038 .pep ASLNDLFILLQNNPE FGQFLEPVRAYRRQYGVEX 

M It II I I M II I I I I M [ I I M 1 I I II I I I M | 
a 0 3 8 AS LNDLFI LLQNNPE FGQFLE PVRAYRRQYGVEX 

190 200 210 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 038 shows 98.1% identity over a 213 aa overlap with a predicted ORF (ORF 038.ng) 
from TV. gonorrhoeae: 

m038/g038 



m038 . pep 
g038 

m038 .pep 
g038 

m038 .pep 
g038 



1° 20 30 40 50 60 

MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
' 1 f > t M I M I I I I I I | I | |[ | f | | | I I I I E I f | | | j | | | { | I | | M I I I I M | I | | | | 

MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGASTLQLAKFYAQSn 
10 20 30 40 50 



3IIES 
60 



70 



80 



90 100 110 120 

GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 
■'lillMIMIIItllllMllllilMMMIMIIMMhlllllllllMIIMI 
GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDRGEGGVLVGAPLKGRVL 
70 80 90 100 HO 120 



130 



140 



150 160 170 180 

1 1 DDVI SAGTSVRESIKLIEAEGAT PAGVAI ALDRMEKGTGELS AVQEVEKQYGLPVAP I 
1 M M I M I I II I II I I I I I M II j | | | | | M I I I I I I I I f : | | M I I M I II I I I I I M 

1 1 DDV I S AGT S VRE S I KL I EAEGAT PAG VAIALDRMEKGTGKLS AVQEVEKQYGLPVAP I 
130 140 150 160 170 180 



190 200 210 

m038 .pep ASLNDLFI LLQNN PE FGQFLE PVRAYRRQYGVEX 

M M I I I I I I I I I I I I I I I I I I M : I I I | | | | | | 
90 38 ASLNDLFI LLQNN PE FGQFLE PVRT YRRQYGVEX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 141>- 

g039.seq 

1 ATGCCGTCCG AACCACCTGC CGCTTCAGAC GGCATCAAAC CGACACACAC 

51 CGAGAAAACA TCATGCCCGC CTGTTTCTGT CCGCACTGCA AAACCCGCCT 

101 CTGGGTCAAA GAAAcccagC TCAAcgtCgC ccaagGCTTC GTCGTCTgcc 

151 aaAAAtgcga agGGCTgttt aaAgccaaaG accAtctggc aaGcacGAAA 

201 gaacctatat tcaacgattg gcccgaagct gtttcgggat gTcaaaCTCG 
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251 TCcaccgcaT cggcacgcac gccattagca aGAaacagat gtcccgcgac 

3 01 gaaatcgccg atatcctcaa cggcggtaca acCCTGCACG ATACGCCGCC 

3 51 CGCAACCGCC GCTGCCGCac ctGCCGCCGC ACCGCaggTT TCCGTACCGC 

4 01 CCGCCCGTCA GGAAGGGCTC AACTGGACTA TTGCAACCCT GTTCGCACTT 
451 ATCGTCCTCA TTATGCAGCT TTCCTACCTC TTCATCCTAT GA 



This corresponds to the amino acid sequence <SEQ ID 142; ORF 039.ng>: 

g03 9 .pep 

1 MPSEPPAASD GIKPTHTEKT SCPPVSVRTA KPASGSKKPS STSPKASSSA 

51 KNAKGCLKPK TIWQARKNLY STIGPKLFRD VKLVHRI GTH AISKKQMSRD 

101 EIADILNGGT TLHDTP PATA AAAPAAAPQV SVPPARQEGL NWTIATLFAL 

151 IVLIMQLSYL FIL* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 143>: 



m039 . seq 












1 


ATGCCGTCCG 


AACCGCCTTA 


CGCCTCAGAC 


GGCATCAAAC 


CTGACACACA 


51 


CGAGGAAATA 


CCATGCCCGC 


CTGTTTCTGC 


CCCCACTGCA 


AAACCCGTCT 


101 


CTGGGTCAAA 


GAAACCCAAC 


TCAATGTCGC 


CGnnnnnnnn 


nnnnnnnnnn 


151 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


201 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnCCC 


GAGGCTGTTT 


251 


CGGATGTCAA 


ACTCGTTCAC 


CGTATCGGCA 


CGCGCGCCAT 


CGGCAAGAAA 


301 


CAGATTTCCC 


GTGACGAAAT 


CGCCGGCATC 


CTCAACGGCG 


GTACAACCCA 


351 


GCCCGATATT 


CCGCCCGCAA 


CCGCCGCCAC 


CCCTGCTGCC 


GCACCGCAGG 


401 


TTACCGTACC 


GCCCGCCGCG 


CCCGCCCGTC 


AGGATGGGTT 


CAACTGGACG 


451 


ATTGCAACCC 


TGTTTGCCCT 


TATCGTCCTC 


ATTATGCAGC 


TTTCCTACCT 


501 


CGTCATCCTA 


TGA 









This corresponds to the amino acid sequence <SEQ ID 144; ORF 039>: 

xn039 .pep 

1 MPSEPPYASD GIKPDTHEEI PCPPVSAPTA KPVSGSKKPN SMSPXXXXXX 

51 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXP EAVSDVKLVH RIGTRAIGKK 

101 QISRDEIAGI LiNGGTTQPDI PPATAATPAA APQVTVPPAA PARQDGFNWT 

151 IATLFALIVL IMQLSYLVIL * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 145>: 

a039.seq 

1 ATGCCGTCTG AACCGCCTTA CGCCTCAGAC GGCATCAAAC CTGACACACA 

51 CGAGGAAATA CCATGCCCGC CTGTTTCTGC CCCCACTGCA AAACCCGTCT 

101 CTGGGTCAAA GAAACCCAAC TCAATGTCGC CCAAGGCTTC GTCGTCTGCC 

151 AAAAATGCGA AGGAATGTTT AAAGCCAAAG ACCATCTGGC AAGCACGAAA 

201 GAACCCATAT TCAACGATT . TGCCCGAAGC TGTTTCGGAT GTCAAACTCG 

251 TTCACCGCAT CGGCACGAGC GCCATCGGCA AGAAACAGAT TTCCCGTGAC 

301 GAAATCGCCG GCATCCTCAA CGGCGGCACA ACCCAGCCCG ATATTCCGCC 

351 CGCAACCGCC GCCACCCCTG CTGCCGCACC GCAGGTTACC GTACCGCCCG 

4 01 CCGCGCCCGC CCGTCAGGAT GGGTTCAACT GGACGATTGC AACCCTGTTT 

4 51 GCCCTTATCG TCCTCATTAT GCAGCTTTCC TACCTCGTCA TCCTATGA 

This corresponds to the amino acid sequence <SEQ ID 146; ORF 039.a>: 

a039.pep 

1 MPSEPPYASD GIKPDTHEEI PCPPVSAPTA KPVSGSKKPN SMSPKASSSA 

51 KNAKECLKPK TIWQARKNPY STIXPEAVSD VKLVHRI GTS AIGKKQISRD 

101 EIAGILNGGT TQPDIPPATA ATPAAAPQVT VPPAAPARQD GFNWT IATLF 

151 ALIVLIMQLS YLVI L* 



m039/a039 79.4% identity over a 1 70 aa overlap 

10 20 30 40 50 60 

m03 9 .pep MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPXXXXXXXXXXXXXXXX 

I I I I I M I I II I I I I I I I M II ! I I I I I I I I I I I I I 1 I I | I | M 
a039 MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPKASSSAKNAKECLKPK 
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10 20 30 



40 50 60 



70 80 90 100 no i yn 

m039.pep XXXXXXXXXXXXXXXXXXXPEAVSDVKLVHRIGTRAIGKKQISRDEIAGILNGGTTQPDI 

a039 TIWQARKNPYSTIX PEAVSDVKLVHRIGTSAIGKKQISRDEIAGILN^ 

70 80 90 100 no 

130 140 150 160 170 

m03 9.pep PPATAATPAAAPQVTVPPAAPARQDGBTvlWTIATLFALIVLIMQLSYLVILX 
1 1 1 H I I M I I I 1 I M I I I I M I I M I I M f I I M I I I I f I I | M I I II 1 I 
PPATAATPAAAPQVTVPPAAPARQDGFNWT IATLFALIVLIMQLSYLVILX 
120 130 140 150 160 



a039 



Computer analysis of this amino acid sequence gave the following results* 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 039 shows 60.8% identity over a 171 aa overlap with a predicted ORF (ORF 039 ng) 
from TV. gonorrhoeae: * BJ 

m039/g039 



m03 9 .pep 



10 20 30 40 



50 60 



MPSEPPYASDGIKPDTHEEIPCPPVSA 

MUM Mill II h I Mlh MlhUlilhl II 

g039 MPSEPPAASDGIKPTHTEKTSCPPVSVRTAKPASGSKKPSSTSPKASSSAKNAKGCIjKPK 

10 20 30 40 50 60 

70 80 90 100 HO 120 

m03 9 . pep XXXXXXXXXXXXXXXXXXXPEAVSDVKLVHRIGTRAIGKKQISRDEIAGILNGGTTQPDI 

903 9 TIWQARKNLYSTIG PKLFRlii^^ 

70 80 90 100 no 

130 140 150 160 170 

m039.pep PPATAAT-PAAAPQVTVPPAAPARQDGFNWTIATLFAXtlVLIMOLSYLVILX 
MUlh IIIIMIHIII | | r 1 » | J | f | | | | | | I | | | | | | | | | |M 

903 9 P PATAAAAPAAAPQVS VPP A - - - RQEGLNWTIATLFALIVLIMQLSYLFILX 

120 130 140 150 160 

The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 147>- 

g040.seq v ' * 

1 ATGAACGCGC CCGACAGCTT TGTCGCCCAC TTCCGCGAAG CCGCCCCCTA 
51 CATCCGCCAA ATGCGCGGCA CGACACTGGT CGCCGGCATA GAcaaCCGCC 

TnCTrrZIUmn (-rv^ncOTirn* i»m,.^™-_ 



, „ — v-.ouuoui_«i\fi (jACggcCGCC 

101 TGCTCGAAGG CGGCACCTTA AATAAGCTCG CCGCCGACAT CGGGCTGTTG 

151 TCGCAACTGG GCATCCGACT CGTCCTCATC CACGGCGCGT ACCACTTCCT 

201 CGAccgCCTC GCCGCCGCGC AAGgccGCAC GCCGCATTAT TGCCGgqgtt 

251 tGCGCGTTAC CGACGaAACc tcGctcgGAC AGGCGCAGCA GtttGCCGGC 

301 AccgTCCGCA GCCGTTTTGA agcCGCATTG tgcggcagCG tttcaggatt 

^ o^ gCGCGCG CCTTCCGTCC CGCTCGTAtc gggcaacttc ctgacCGCCC 

401 GTCcgatggg cgtgattgac ggaACCGata tggaatacgc gggggttatc 

451 cgcaaaaccg ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGATG CCGCCGCTCG GGCATTCCTA CGGCGGCAAA ACCTTCAATC 

551 TCGATATGGT GCAGGCCGCC GCTTCCGTCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGAA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

™ ^GCCAGCGA AACCCGACGA CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 

751 GGCGGCGTGC ATCGCGTCCA AATCCTCAAC GGGGCCGCCG ACGGCAGCCT 

801 GCTGCAAGAA CTCTTCACCC GCAACGGCAT CGGCACGTCC ATTGCCAAAG 

851 AAGCCTTCGT CTCCATCCGG CAGGCGCACA GCGGCGACAT CCCGCAct^C 

901 GCCGCCCTCA TCCGCCCGCT GGAAGAACAG GGCGTCCTAT TGcScCGC^G 

951 CCGCGAATAC CTCGAAAACC ACATTTCCGA ATTTTCCATC CTcSSc^G 
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1001 ACGGCGACCT GTACGGCTGT GCCGCACTCA AAACCTTTGC CGAAGCCGAT 

1051 TGCGGCGAAA TCGCCTGCCT TGCCGTCTCG CCGCAGGCAC AGGACGGCGg 

1101 CtACGGCGAA CGCCTGCTTG CCCACATTAT CGATAAGGCG CGCGGCATAG 

1151 GCATAAGCAG GCTGTTCGCA CTGTCCACAA ATACCGGCGA ATGGTTTGCC 

1201 GAACGCGGCT TTCAGACGGC ATCGGAAGAC GAGCTGCCCG AAACGCGGCG 

1251 CAAAGACTAC CGCAGCAACG GACGAAACCC GCATATTCTG GTGCGTCGCC 

1301 TGCACCGCTG A 

This corresponds to the amino acid sequence <SEQ ID 148; ORF 040.ng>: 

g040 .pep 

1 MNAPDSFVAH FREAAPYIRQ MRGTTLVAGI DGRLLEGGTL NKLAADIGLL 

51 SQLGIRLVLI HGAYHFLDRL AAAQGRTPHY CRGLRVTDET SLGQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPMGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWM PPLGHSYGGK TFNLDMVQAA ASVAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAE TLSAQEAQSL AEHAASETRR LISSAVAALE 

251 GGVHRVQILN GAADGSLLQE LFTRNGIGTS IAKEAFVSIR QAHSGDIPHI 

301 AALIRPLEEQ GVLLHRSREY LENHISEFSI LEHDGDLYGC AALKT FAEAD 

351 CGEIACLAVS PQAQDGGYGE RLLAHIIDKA RGIGISRLFA LSTNTGEWFA 

4 01 ERGFQTASED ELPETRRKDY RSNGRNPHIL VRRLHR+ 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 149>: 

m040 . seq 

1 ATGAGCGCGC CCGACCTCTT TGTCGCCCAC TTCCGCGAAG CCGTCCCCTA 

51 CATCCGCCAA ATGCGCGGCA AAACGCTGGT CGCCGGCATA GACGACCGCC 

101 TGCTCGAAGG TGATACCTTA AACAAGCTCG CCGCCGACAT CGGGCTGTTG 

151 TCGCAACTGG GCATCAGGCT CGTCCTCATC CACGGCGCGC GCCACTTCCT 

201 CGACCGCCAC GCCGCCGCTC AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 

251 TGCGCGTTAC CGACGAAACC TCGCTCGAAC AGGCGCAgCA GTTTGCCGGC 

301 ACCGTCCGCA GCCGTTTTGA AGCCGCATTG TGCGGCAGCG TTTCCGGGTT 

351 CGCGCGCGCG CCTTCCGTCC CGCTCGTATC GGGCAACTTC CTGACCGCCC 

4 01 GTCCGATAGG TGTGATTGAC GGAACCGATA TGGAATACGC GGGCGTTATC 

4 51 CGCAAAACCG ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGCTG CCGCCGCTCG GACATTCCTA CAGCGGCAAG ACCTTCTATC 

551 TCGATATGCT TCAAACCGCC GCCTCCGCCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGAA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

701 CCGGCGGGCA AACGCGACGG CTGATTTCGT CCGCCGAACT CTTCACCCGC 

751 AACGGCATCG GCACGTCCAT TGCCAAAGAA GCCTTCGTCT CCATCCGGCA 

801 rGCGCAywgG G . CGACATCC CGCACATCGC CGCCCTCATC CGCCCGCTGG 

851 AAGAACAGGG CATCCTGCTG CACCGCAs . c GCGAATACCT CGAAAACCAC 

901 ATTTCCGAAT TTTCCATCCT CGAACACGAC GGCAACCTGT ACGGTTGCGC 

951 CGCCCTGAAA ACCTTTGCCG AAGCCGATTG CGGCGAAATC GCCTGCCTTG 

1001 CCGTCTCGCC GCag. cACAG GACGGCGGCT ACGGCGAACG CnTGCTTGCC 

1051 CACATTATCG ATAAGGCGCG CGGCATAGGC ATAAGCAGGC TGTTCGCACT 

1101 GTCCACAAAT ACCGGCGAAT GGTTTGCCGA ACGCGGCTTT CAGACGGCAT 

1151 CGGAAGACGA GTTGCCCGAA ACGCGGCGCA AAGACTACCG CAGCAACGGA 

1201 CGGAACTCGC ATATTCTGGT ACGTCGCCTG CACCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 150; ORF 040>: 

m04 0 .pep 

1 MSAPDLFVAH FREAVPYIRQ MRGKTLVAGI DDRLLEGDTL NK LAADIGLL 

51 SQLGIRLVLI HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLEQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPIGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWL PPLGHSYSGK TFYLDMLQTA ASAAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAE TLSAQEAQSL AEHAGGQTRR LISSAELFTR 

251 NGIGTSIAKE AFVSIRQAHX XDIPHIAALI RPLEEQGILL HRXREYLENH 

301 ISEFSILEHD GNLYGCAALK TFAEADCGEI ACLAVSPQXQ DGGYGERXLA 

351 HIIDKARGIG ISRLFALSTN TGEWFAERGF QTASEDELPE TRRKDYRSNG 

4 01 RNSHILVRRL HR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 151>: 

a040.seq 
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1 ATGATCGTGC CCGACCTCTT TGTCGCCCAC TTCCGCGAAG CCGCCCCCTA 
51 CATCCGCCAA ATGCGCGGCA AAACGCTGGT CGCCGGCATA GACGACCGCC 
101 TGCTCGAAGG TGATACCTTA AACAAGTTCG CCGCCGACAT CGGGCTTTTG 
151 TCGCAACTGG GCATCAGGCT CGTCCTCATC CACGGCGCGC GCCACTTCCT 
201 CGACCGCCAC GCCGCCGCGC AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 

251 TGCGCGTTAC CGACGAAACC TCGCTCGAAC AGGCGCAGCA GTTTGCCGGC 

301 ACCGTCCGCA GCCGTTTTGA AGCCGCATTG TGCGGCAGCG TTTCCGGGTT 

351 CGCGCGCGCG CCTTCCGTCC CGCTCGTATC GGGCAACTTC CTGACCGCCC 

4 01 GTCCGATAGG TGTGATTGAC GGAACCGATA TGGAATACGC GGGCGTTATC 

4 51 CGCAAAACCG ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGCTG CCGCCGCTCG GACATTCCTA CAGCGGCAAG ACCTTCCATC 

551 TCGATATGCT TCAAACCGCC GCCTCCGTCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGTA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

7 01 CCGGCGGCGA AACGCGACGG CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 
751 GGCGGCGTGC ATCGCGTCCA AATCCTCAAC GGAGCCGCCG ACGGCAGCCT 

8 01 GCTGCAAGAA CTCTTCACCC GCAACGGCAT CGGCACGTCC ATTGCCAAAG 
851 AAGCCTTCGT CTCCATCCGG CAGGCGCACA GCGGCGACAT CCCGCACATT 
901 GCCGCCCTCA TCCGCCCGCT GGAAGAACAG GGCATCCTGC TGCACCGCAG 
951 CCGCGAATAC CTCGAAAACC ACATTTCCGA ATTTTCCATC CTCGAACACG 

1001 ACGGCAACCT GTACGGTTGC GCCGCCCTGA AAACCTTTGC CGAAGCCGAT 

1051 TGCGGCGAAA TCGCCTGCCT TGCCGTCTCG CCGCAGGCAC AGGACGGCGG 

1101 CTACGGCGAA CGCCTGCTTG CCCACATTAT CGATAAGGCG CGCGGCATAG 

1151 GCATAAGCAG GCTGTTCGCA CTGTCCACAA ATACCGGCGA ATGGTTTGCC 

1201 GAACGCGGCT TTCAGACGGC ATCGGAAGAC GAGTTGCCCG AAACGCGGCG 

1251 CAAAGACTAC CGCAGCAACG GACGGAACTC GCATATTCTG GTGCGTCGCP 

1301 TGCACCGCTG A 

This corresponds to the amino acid sequence <SEQ ID 152; ORF 040.a>: 

a040.pep 

1 MIVPDLFVAH FREAAPYIRQ MRGKTLVAGI DDRLLEGDTL NKFAADIGLL 

51 SQLGIRLVLI HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLE QAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPIGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWL PPLGHSYSGK TFHLDMLQTA ASVAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAV TLSAQEAQSL AEHAGGETRR LISSAVAALE 

251 GGVHRVQILN GAADGSLLQE LFTRNGIGTS IAKEAFVSIR QAHSGDIPHI 

301 AALIRPLEEQ GILLHRSREY LENHISEFSI LEHDGNLYGC AALKTFAEAD 

351 CGEIACLAVS PQAQDGGYGE RLLAHIIDKA RGIGISRLFA LSTNTGEWFA 

4 01 ERGFQTASED ELPETRRKDY RSNGRNSHIL VRRLHR * 

m040/a040 91.5% identity in 436 aa overlap 



m04 0 .pep 
a040 

m04 0 .pep 



10 20 30 40 50 60 

MSAPDLFVAHFREAVPYIRQMRGKTLVAGIDDRLLEGDTLNKLAADIGLLSQLGIRLVLI 
1 :NIMIIMf * : illNMMIII||||||||||, N ,j : ,,| IM|| y |M| ,;({ 

MIVPDLFVAHFREAAPYIRQMRGKTLVAGIDDRLLEGDTLNKFAADIGLL^LGIRLVLI 
10 ' 20 30 40 50 60 

70 80 - 90 100 110 120 

HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 



A/n 1 1 11 1 1 1 1 M 1 1 1 M I I I I I I I | I M I I I I I | | M I I M I t I II I I I I f I M 

3040 HGARHFLDRHAAAQGRTPHYCRGLRVTD^ 

70 80 90 100 HO 120 

130 140 150 160 170 180 

PSVPLVSGNFLTARPIGVIDGTDMEYAGVTRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 
M,f '"INI It MM til I III I If) | I M I | I i 1 f j e 7f I I IN Nit i mum 
PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAAL^ 

130 140 150 160 170 180 

190 200 210 220 230 240 

„„„ 1 1 • 1 1 ii ii ii i . 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 , i . 1 1 1 
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190 200 210 220 230 240 

250 260 270 

LISSA ELFTRNG IGTS I AKEAFVS IRQAHXXDI PH I 

I I I I I I I I I I I I I I ! M M I I I I I I I I II I I I I I 
LISSAVAALEGGVHRVQILNGAADGSLLQELFTRNGIGTSIAKEAFVSIRQAHSGDIPHI 

250 260 270 280 290 300 

280 290 300 310 320 330 

AALIRPLEEQG ILLHRXRE YLENHI SEFS I LEHDGNLYGCAALKT FAE ADCGE I ACLAVS 

II I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I | | | 
AALIRPLEEQG ILLHRSREYLENHISEFSILEHDGNLYGCAALKT FAEADCGEIACLAVS 

310 320 330 340 350 360 

340 350 - 360 370 380 390 

PQXQDGGYGERXLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 
M I I I I I I I I I I I I I I M I I i I I I I I | M I I I I I I I | | | | | | | | | | M | | | | | | | | | | 
PQAQDGGYGERLLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 
370 380 390 400 410 420 

400 410 
RSNGRN SH I LVRRLHRX 
I II t I I I I I I I I I I I I I 
RSNGRN SHI LVRRLHRX 
430 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 040 shows 88.3% identity over a 436 aa overlap with a predicted ORF (ORF 040.ng) 
from TV. gonorrhoeae: 

ra040/g040 



m040 . pep MSAPDLFVAHFREAVPYIRQMRGKTLVAGIDDRLLEGDTLNKLAADIGLLSQLGIRLVLI 60 

I : I M I I I I I I I I : I M ! I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I M I I I M I 
g04 0 MNAPDSFVAHFREAAPYIRQMRGTTLVAGIDGRLLEGGTLNKLAADIGLLSQLGIRLVLI 60 

m04 0.pep HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 120 ^ r 

III I I I I I I I I I I 1 I l II I I I M I II I I II I I M I I I I I ! I I I t I I I I I I I ! I I I I I 
g04 0 HGAYHFLDRLAAAQGRTPHYCRGLRVTDETSLGQAQQFAGTVRSRFEAALCGSVSGFARA 120 

m04 0.pep PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 180 

I I I I I I I I I I I M II : I I I M I I I I I I I I I I I I I M I I I E M I I I I i I I : I I I I I I I : I I 

g04 0 PSVPLVSGNFLTARPMGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWMPPLGHSYGGK 180 

m040 .pep TFYLDMLQTAASAAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAGGQTRR 24 0 

II I M : I : I I I : 1 I I II I I I I M I I M I I M I I I I I I I I M I I I I I I I I I I I I ::: I I I 
g040 TFNLDMVQAAASVAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAASETRR 24 0 

m0 4 0.pep LISSA E L FT RNG I GTSIAKEAFVS IRQAHXXDI PHI 27 6 

INN | | | | | | | | | | | I | M I M I I I I I I | | | I I 

g040 LISSAVAALEGGVHRVQILNGAADGSLLQELFTRNGIGTSIAKEAFVSIRQAHSGDIPHI 300 

m04 0.pep AALIRPLEEQG ILLHRXRE YLENHI SEFS I LEHDGNLYGCAALKT FAEADCGEIACLAVS 33 6 

11111111111:1111 I I I II I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I | | | [ | | || 
g04 0 AALIRPLEEQGVLLHRSRE YLENHI SEFS I LEHDGDLYGCAALKT FAEADCGEIACLAVS 3 60 

m040.pep PQXQDGGYGERXLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 396 

M I I I I I I I I I I I I M I I I I I I I I I I I I I i I I I I I I I I M I I I I I I I I M I | | | | | | | 
g04 0 PQAQDGGYGERLLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 420 

m04 0.pep RSNGRN SHI LVRRLHRX 413 

I I I I I I I I I I I I I I I I 
g04 0 RSNGRN PHI LVRRLHRX 4 37 



m04 0 .pep 
a040 

m04 0 . pep 
a040 

m04 0 .pep 
a040 

m04 0 .pep 
a040 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 153>- 

g041.seq 

1 ATGAGTT CGC CCAAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGCCT 

51 GATTACCGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGTGCGCTGG 

101 TGTGCGAAGT ACCGCTGACC GATATGATCC GTTATCCGCT GCTGTCCGCC 

151 GGTTCAAGTT GGACGGACGA ATACGGCAAT CCGCAGAAAT ACGAAGCCTG 

2 01 CAAACGCCGG CTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 
251 TCGATTATCC GCCCGCACTC ATTACCACCA GCCTCAGCGA CGACCGCGTC 
301 CATCCCGCCC ACGCGCTCAA ATTCTACGCC AAACTGCGCG AAACCTCGCC 

3 51 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CCCAACGCGA ATCCGCCGAC AAACTCGCCT GCGTGTTGCT GTTTTTGAAA 
4 51 GAATTTTTGG GATAA 



This corresponds to the amino acid sequence <SEQ ID 154; ORF 041.ng>: 
g041.pep ' 

1 MSSPKHIGL Q GG5NGGLITA AAFVREPQSI GALVCEVPLT DM IRYPLLSA 
51 GSSWTDEYGN PQKYEACKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 
101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQRESAD KLACVLLFLK 
151 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 155>* 

m041.seq 

1 ATCAGTTCGC CCGAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGACT 

51 GATTACTGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGCGCGCTGG 

101 TGTGCGAAGT GCCGCTGACC GACATGATCC GTTATCCGCT GCTGTCCGCC 

151 GGTTCAAGCT GGACAGACGA ATACGGCAAT CCGCAAAAAT ACGAAGTCTG 

201 CAAACGCCGG TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

251 TCGATTATCC GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 

301 CATCCCGCCC ACGCGCTCAA GTTCTACGCC AAACTGCGCG AAACCTCCGC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CCCAACGCGA ATCCGCCGAC GAACTCGCCT GCGTCTTGCT GTTTTTGAAA 

4 51 GAGTTTTTGG GCTAA 



This corresponds to the amino acid sequence <SEQ ID 156; ORF 04 1>: 

m041.pep 

1 ISSPEHIGL Q GGSNGGLITA AAFVREPQSI GALVCEVPLT DM IRYPLLSA 

51 GSSWTDEYGN PQKYEVCKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSAQSW LYSPDGGGHT GNGTQRESAD ELACVLLFLK 

151 EFT.fi* 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 157>- 

a041.seq ^ 

1 ATCAGTTCGC CCGAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGACT 

51 GATTACTGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATA GGCGCGCTGG 

101 TGTGCGAAGT GCCGCTGACC GACATGATCC GTTATCCGCT GCTCTCCGCC 

151 GGTTCAAGCT GGACAGACGA ATACGGCAAT CCGCAAAAAT ACGAAGTCTG 

201 CAAACGCCGG TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

251 TCGATTATCC GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 

301 CATCCCGCCC ACGCGCTCAA GTTCTACGCC AAACTGCGCG AAACCTCGCC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CGCAGCGCGA AGCCGCCGAC GAACTCGCCT GCGTGTTGCT GTTTTTGAAA 

4 51 GAGTTTTTGG GCTAA AAA 

This corresponds to the amino acid sequence <SEQ ID 158; ORF 041. a>* 

a041.pep 

1 ISSPEHIGLQ GGSNGGLIT A AAFVREPQSI GALVCE VPLT DMIRYPLLSA 

" GSSWTDEYGN PQKYEVCKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQREAAD ELACVLLFLK 
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151 EFLG + 

m04i/a04i 98.7% identity over a 154 aa overlap 

10 20 30 40 50 60 

ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 
I I I I M I I I I I I I I I I I I i I I I I I I I I I I I I I ! I I I | | | | | | 1 | | | | | | M | | | | | | | | | 
ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 
10 20 30 40 50 60 

70 80 90 100 110 120 

PQKYEVCKRRLGELSFYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSW 
I M I I I I I I I I I I I I I I I I I I I I i i I I I I I I I M I | I M I I I I I I i i I I I M I I I I III 
PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSW 
70 80 90 100 110 120 

130 140 150 

LYSPDGGGHTGNGTQRESADELACVLLFLKE FLGX 
I I I I I I I M I II II I M : I I I I I I I I I i I I M I I I 
LYSPDGGGHTGNGTQREAADELACVLLFLKEFLGX 
130 140 150 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 041 shows 96.8% identity over a 154 aa overlap with a predicted ORF (ORF 041.ng) 
from N. gonorrhoeae: 

m041/g041 

10 20 30 40 50 60 

ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 
: I I I : M I I I I I II I I I I I I I I I I I I | | M M | M I I I I I I I | I M I I M II I I I I I I I I 
MSSPKHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 
10 20 30 40 50 60 

70 80 90 100 110 120 

PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSW 
I M I I : I I I I I M I I II I I I I I M I I i I I I [ [ | I | | | M I I I I M I I I I I I I I i I I M I 
PQKYEACKRRLGELS PYHNLSDGI DYPPALITT SLS DDRVHPAHALKFYAKLRET S PQSW 
70 80 90 100 110 120 

130 140 150 

LYSPDGGGHTGNGTQRESADELACVLLFLKEFLGX 
I I I I I I I I I M I I I I I I I I I : I I I I | | | | | | | | || 
LYSPDGGGHTGNGTQRESADKLACVLLFLKEFLGX 
130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 159>: 

g04 1-1 . seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CT GAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAAACAA CGACAAGGCG CGCGCACTTT CAGACGGCAT TTTGAATCAA 

151 ATGCAGGACA CGCGGCAGAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

201 GTACCATTTC CAT CAGAATG CGGAATATCC GAAGGGCGTG TACCGCATGT 

251 GTACGGCGGC GACCTACCGT TCCGGCTATC CCGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGATGA GTTGCTCGGC GACGATGTGT ATTTGGGCGG 

351 CGTGTCGCAC TTGGTGGAGC AGCCCAACCG CGCGCTGCTG ACTTTGAACA 

4 01 AATCGGGCGG CGATACGGCG TATACGCTGG AAGTGGATTT GGAAGCAGGG 

4 51 GAATTGGTAG AGGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG GACGAACGCC 

551 AGTTGACCGA ATCGGGCTAT CCGCGCGAAG TGTGGCTGGT GGAACGCGGC 



m041 .pep 
a041 

n\041 . pep 
a041 

m041 .pep 
a041 



mO 4 1 . pep 
g041 

m041 .pep 
g041 

mO 4 1 . pep 
g041 
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601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 



AAGAGTTTCG 

GATGGTAAAC 

TGATTGAAGC 

TCCGAAGGCG 

CGGCTATCTG 

GCGCGAACCA 

CGGGGCGAAC 

GGCATTGGAA 

TGGAGAATGT 

TGGCAGGAAG 

CGACCAACCG 

CCACGCCGCT 

GTCATGCGCC 

GCAGTTTTGG 

TCGGCAAAAA 

GGTTTCGGCA 

TTGGCTGGAA 

GAGAATTCGG 

AAAAGCGTTG 

CATGAGTTCG 

TGATTACCGC 

GTGTGCGAAG 

CGGTTCAAGT 

GCAAACGCCG 

ATCGATTATC 

CCATCCCGCC 

CGCAATCTTG 

ACCCAACGCG 

AGAATTTTTG 



AGGAAAGCCT 
GCGTGGCGTT 
GTCGGACGGT 
GGGCGAAACC 
GCGGGACATC 
AAGCTATCCG 
TCGGGGCGGC 
AGCGTGGAAA 
ACAAGGCCGT 
CCGAGTTGCC 
TGGGGCGGCG 
GACGCTGTTT 
TCCAGCCGCA 
GCGGTGTCGT 
CGCCGCGCCC 
TTCCTGAATT 
GAGGGCAATG 
CCCGCGCTGG 
ATGATTTGTT 
CCCAAACACA 
CGCCGCCTTC 
TACCGCTGAC 
TGGACGGACG 
GCTGGGCGAA 
CGCCCGCACT 
CACGCGCTCA 
GCTCTACTCG 
AATCCGCCGA 
GGATAA 



GCCGGCGTAC 

ACCTCGATCC 

TTTTACACCA 

GTTGAACCTG 

TTTTGCTGAC 

AGTGGCGCGT 

GCAGCTTTTG 

CGACCAAGCG 

CTGAAAGCGT 

GCACCTGCCC 

ACGTGGTTTA 

GCGCTGGATT 

GCAGTTTGTT 

CCGACGGCGA 

GACACGCCGA 

GCCGCATTAT 

CCTTTGTATT 

CATCAGGCGG 

GGCAGTCGTG 

TCGGCTTGCA 

GTGCGCGAAC 

CGATATGATC 

AATACGGCAA 

TTGTCGCCGT 

CATTACCACC 

AATTCTACGC 

CCTGACGGCG 

CAAACTCGCC 



CAAATCGATA 
GCAGGGTTCG 
AGACGTATTT 
CCTAATGATT 
GCTGCGCAAG 
TGGTGGCGGT 
TTTGCGCCCG 
TTTTGTGGTG 
GGCGGTTTGC 
TCGGGCGCGT 
TCTTGCCGCC 
TGAACGTGAT 
TCAGACGGCA 
ACGCATTCCT 
CCTTAGTCTA 
CTGGGCAGCG 
GGCAAACATC 
CGCAGGGAAT 
CGTGATTTGT 
GGGCGGCAGC 
CGCAAAGCAT 
CGTTATCCGC 
TCCGCAGAAA 
ATCACAATCT 
AGCCTCAGCG 
CAAACTGCGC 
GCGGCCATAC 
TGCGTGTTGC 



AAGGCGCGAT 
CCGATTGATT 
GCAGGTGTCG 
GCGATGTGGT 
GACTGGCACC 
GAAACTGAAT 
ATGAAACGCA 
GCAAGCCTGC 
CGACAGCAAA 
TGGAAATGAC 
AGCGATTTCA 
GGAACTGACC 
TCGAAGTGCG 
TATTTCCACG 
TGCTTACGGA 
TCGGCAAATA 
CGCGGCGGCG 
CAGCAAACAC 
CCGAACGCGG 
AACGGCGGCC 
CGGTGCGCTG 
TGCTGTCCGC 
TACGAAGCCT 
TTCAGACGGC 
ACGACCGCGT 
GAAACCTCGC 
CGGCAACGGC 
TGTTTTTGAA 



This corresponds to the amino acid sequence <SEQ ID 160; ORF 041-Lng>: 
g041-l.pep * 



1 


MKSYPDPYRH 


51 


MQDTRQIPFC 


101 


SVADFDELLG 


151 


ELVEGGFHFP 


201 


KS FEES L PAY 


251 


SEGGAKPLNL 


301 


RGELGAAQLL 


351 


WQEAELPHLP 


401 


VMRLQPQQFV 


451 


GFGIPELPHY 


501 


KSVDDLLAW 


551 


VCEVPLTDMI 


601 


IDYPPALITT 


651 


TQRESADKLA 



PKHIGLQGGS NGGLITAAAF 



RALSDGILNQ 
SGYPEWKILF 
YTLEVDLEAG 
PREVWLVERG 
FYTKTYLQVS 
SGALVAVKLN 
LKAWRFADSK 
ALDLNVMELT 
DTPTLVYAYG 
HQAAQGISKH 
VREPQSIGAL 
LSPYHNLSDG 
PDGGGHTGNG 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 161>- 

m041-l.seq 



1 


ATGAAATCCT 


51 


CGAAACGCAA 


101 


TTTTAGAAAA 


151 


TTGCAGGACA 


201 


GTACCATTTC 


251 


GTACCGCGGC 


301 


TCGGTGGCGG 


351 


CGTGTCGCAC 


401 


AATTGGGCAG 


451 


GAGTTGGTCG 


501 


GTGGCGCGAT 


551 


AGTTGACCCA 


601 


AAGAGTTTCG 


651 


GATGGTGAAC 


701 


TGATTGAAGC 



: CTACCGCCAT 
! CTGAAGCGAA 
I CGCGCGCTTT 
1 TCCGTTTTGT 
i CGGAGTATCC 
' TCCGGCTATC 
. ATTGCTTGGC 
AGCCCAACCG 
TACACGCTGG 
TCACTTTCCG 
TGTGGGTGTG 
CCGCGCGAAG 
GCCTGTGTAT 
ATCTCGATCC 
TTTTACACCA 



TTTGAAAACC 
TGCCGAAACG 
CAGACGGCAT 
CAGGAACACC 
GAAGGGCGTG 
CCGAGTGGAA 
GACGATGTGT 
CGCGTTGTTA 
AAGTGGATTT 
GCAGGCAAAA 
TCCGGCTTGG 
TATGGCTGGT 
CAAATCGGCG 
GCAGGGTTCG 
AAACCTATTT 



TCGATTCCGC 
CGCGCGCGTT 
TTTGGCGCAG 
GCGCGCGGAT 
TACCGCGTGT 
AATCCTGTTT 
ATTTGGGCGG 
AC ACT GAG C A 
GGAAGCAGGG 
ACCATGTGTC 
AACGAACGCC 
GGAACGCGGC 
AAGAC GG CAT 
CCGATTGATT 
GCGGGTCTCA 
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7 51 GCCGAAGGCG AGGCGAAACC GTTAAACCTG CCCAACGATT GCGACGTGGT 

801 CGGCTATCTG GCGGGGCATC TTTTGCTGAC GCTGCGCAAG GACTGGAACC 

851 GCGCGAACCA AAGCTATCCG AGCGGCGCGC TGGTGGCGGT GAAGCTGAAT 

901 CGGGGCGAAC TCGGGGCGGC GCAGCTTTTG TTTGCGCCCG ATGAAACGCA 

951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTGGTG GCGAGCCTGT 

1001 TGGAGAACGT ACAAGGCCGT CTGAAAGCAT GGCGGTTTGC CGACGGCAAA 

1051 TGGCAGGAAG TCGAATTGCC GCGCCTGCCT TCGGGCGCGT TGGAAATGAC 

1101 CGACCAACCT TGGGGCGGCG ACGTGGTTTA CCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCGCTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC GCCAGCCGCA GCAGTTTGAT TCAGACGGCA TTAACGTGCA 

1251 GCAGTTTTGG ACGACTTCGG CTGACGGCGA GCGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACATGCCGA CGCTGGTCTA TGCCTACGGC 

1351 GGTTTCGGCA TTCCCGAATT GCCGCATTAT CTGGGCAGCA TTGGCAAATA 

14 01 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCGAACATC CGCGGCGGCG 

14 51 GCGAGTTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAT 

1501 AAAAGCGTTG ATGATTTATT GGCAGTCGTG CGCGATTTGT CCGAACGCGG 

1551 TATCAGTTCG CCCGAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGAC 

1601 TGATTACTGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT CGGCGCGCTG 

1651 GTGTGCGAAG TGCCGCTGAC CGACATGATC CGTTATCCGC TGCTCTCCGC 

1701 CGGTTCAAGC TGGACAGACG AATACGGCAA TCCGCAAAAA TACGAAGTCT 

17 51 GCAAACGCCG GTTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 

1801 AT C GAT TAT C CGCCCGCGCT CATTACCACC AGCCTGTCCG ACGATCGCGT 

1851 CCATCCCGCC CACGCGCTCA AGTTCTACGC CAAACTGCGC GAAACCTCCG 

1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 

1951 ACCCAACGCG AATCCGCCGA CGAACTCGCC TGCGTCTTGC TGTTTTTGAA 

2001 AGAGTTTTTG GGCTAA 

This corresponds to the amino acid sequence <SEQ ID 162; ORF 041-1>: 

m041-l .pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLENDKA RALSDGILAQ 

51 LQDTRQIPFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSKLGSDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW NERQLTQSGY PREVWLVERG 

201 KSFEESLPVY QIGEDGMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLRVS 

251 AEGEAKPLNL PNDCDWGYL AGHLLLTLRK DWNRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPDETQALE SVETTKRFVV ASLLENVQGR LKAWRFADGK 

351 WQEVELPRLP SGALEMTDQP WGGDWYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERIP YFHVGKNAAP DMPTLVYAYG 

451 GFGIPELPHY LGSIGKYWLE EGKAFVLANI RGGGEFGPRW HQAAQGI SKH 

501 KSVDDLLAVV RDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSAQSWLYS PDGGGHTGKG 

651 TQRESADELA CVLLFLKEFL G* 

m041-l/g041-l 94.6% identity in 671 aa overlap 

10 20 30 40 50 60 

m041-l .pep MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFC 
M I I II I I I I I I M f I It I I I I t I I I I I I I I I I I I : I M I I I I I I I I I I : I S I I I II 1 I 
g041-l MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLNNDKARALSDGILNQMQDTRQIPFC 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 1-1 . pep QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 
I I I I I I I I I I I I : I I I I I I I I I : I I I I I I I I I I I I I I I I M I II I I I I I ! I I I M I I I I I 
g041-l QEHRARMYHFHQNAEYPKGVYRMCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m04 1-1 . pep LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 
I I I I I M I I I II : I I : I I I I I I II i I I I I I 1 I I I I I I ! I I I I I I I I I I I I I I I | M I I I 
g041-l LVEQPNRALLTLNKSGGDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m041-l .pep NERQLTQSGYPREVWLVERGKSFEESLPVYQIGEDGMMVNAWRYLDPQGS PIDLIEASDG 
: I I I M : I I I I I I I I I I I I I I I I I I I I I : I I I : : I I I I I i I i I I I I I I I I II I I I I I I 
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m0 41-1 . pep 
g041-l 

m041-l . pep 
g041-l 

m041-l .pep 
g041-l 



g041-l 



g041-l DERQLTESGYPREVWLVERGKSFEESLPAYQIDKGAMMVNAWRYLDPQGSPIDLIEASDG 

190 200 210 220 230 240 

250 260 270 280 290 300 

FYTKTYLRVSAEGEAKPLNLPNDCDWGYLAGHLLLTLRKDWNRANQSYPSGALVAVKLN 
M I II I I : I I : I I I M I I I 11 i j i | M I M M M ! I I M I I : I j I f I I i I! [ I I I 1 I I I 
FYTKTYLQVSSEGGAKPLNLPNDCDWGYLAGHLLLTLRKDWHRANQSYPSGALVAVKLN 
2$° 260 270 280 290 300 

310 320 330 340 350 36 0 

RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADGKWQEVELPRLP 
Mf IH'M|[NIIIMllMM|||M|||j||||MIMItMM:ltM:MI-M 
RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADSKWQEAELPHLP 
310 320 3.30 340 350 360 

370 380 390 400 410 420 

SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVOOFW 
' I M i I M I I M I M M I I I 1 M I M M I II I M I I I I I | | | | Mill llll-l-ll | 

SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRLQPQQFVSDGIEVRQFW 
370 380 390 400 410 420 

n,, , 430 440 450 460 470 480 

m041-l.pep TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPHYLGSIGKYWLEEGNAFVLANI 

AVSSDGERIPYFHVGKNAAPDTPTLVYAYGGFGIPELPHYLGSVGKYWLEEGNA^LANI 
430 44 0 450 460 470 480 

n,, , 490 500 510 520 530 540 

m04 1-1 . pep RGGGEFGPRWHQAAQGISKHKSVDDLLAVVRDLSERGISSPEHIGLQGGSNGGLITAAAF 
nyl1 , 1 1 1 1 1 * I I I ) M I II I II I I I M I | | M II I I M I II : I i I : I I M I M I I I | M II I M 

g041-l RGGGEFGPRWHQAAQGISKHKSVDDLI^WRDLSERGMSSPKHIGLQGGSNGGLITAAAF 

490 5 00 510 520 530 5 40 

550 560 570 580 590 600 

m041-l.pep VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
11 1 I 1 1 1 i 1 I I I I I I I M II I M I | M M I I I I M I I I 1 M I : I I I I I I M I I I I I M I I 
VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEACKRRLGELSPYHNLSDG 
550 560 570 580 590 600 

610 620 630 640 650 660 

IDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGGGHTGNGTQRESADELA 

nai n I I II II I I I IIMMIIIIIMI IIIIIMIMMIIIIMIIMI-M 

g041-l IDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSWLYSPDGGGHTGNGTQRE SADKLA 

610 620 630 640 650 660 

670 

m041-l .pep CVLLFLKEFLGX 
I M II I II I I I I 
g 0 4 1 - 1 CVLL FLKE FLGX 

670 

m041-l/P55577 

NGR234rL^gth-^726 ^ PEPTIDASE Y 4N A >gi|218 2 536 (AE000086, Y4nA [Rhizobiua, sp. 

Score = 370 bits (940), Expect = e-101 

Identities = 217/682 (31%), Positives = 331/682 (47%), Gaps - 22/682 (3%) 

Query: 2 KSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFCQ 61 

K DP + +D + + N T + ++ + L LQ T +1 

Sbuct: 42 KDASDPRAYLNEIDGDKAMTWVEAHNLSTVDKLSKDPRYSEYQADALTILQATDRIASPS 101 

Query : 62 EHRARMY-HFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 120 

R M + F QD + +G++R T +YRSG P+W+ + V + G G 
Sb;jct: 102 FARDGMIDNFWQDGTHVQGLWRRTTWESYRSGNPQWRTILDVDALSKAEGKTWVFEGGDC 161 

Query: 121 LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSKRDENSVWVCPAW 180 
N L+ LS G D E D+ GE V+ GF P GK V+W DEN+++V W 



g041-l 



m041-l .pep 
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Sbjct: 162 LPPTSNLCLIRLSDGGKDADVVREFDIAKGEFVKEGFVLPEGKQSVTWVDENTI YVTREW 221 

Query: 181 NERQLTQSGYPREVWLVERGKSFEESLPVYQ IGEDGMM — VNAWRYLDPQGSPI 232 

++T SGY +V+RG+S ++++ +++ E G++ ++ +D + 

Sbjct: 222 TPGEVTSSGYAYVTKVVKRGQSLDQAVEIFRGQKKDVSAERGVLRDIDGKYVMDTSYRGL 281 

Query: 233 DLIEASDGFYTKTYLRVSAEGEAKPLNLPNDCDVVGYLAGHLLLTLRKDWNRANQS-YPS 291 

D FY + + L LP GY G + L+ DW A + + + 

Sbjct: 282 DFFNTELAFYPNGH PDTRKVVLPLPTTAVFSGYYKGQAI YWLKSDWTSAKGTVFHN 337 

Query: 292 GALVAVKLNRGELGAAQL LFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFA 347 

GA++A L A++ LF P+E Q++ TK +V S+L NV ++++ F 

Sbjct: 338 GAIIAFDLKAALADPARVEPLVLFMPNEHQSVAGTTQTKNRLVLSILSNVTSEVRSFDFG 397 

Query: 348 DGKWQEVELPRLPSGALEMTDQPWGGDVVYLAASDFTTPLTLFALDLNVMELTVMRRQPQ 407 

G W +L + L +T D +++ + F P TLF D ++ + P 

Sbjct: 398 KGGWSSFKLALPENSTLSLTSSDDESDQLFVFSEGFLEPSTLFCADAATGQVEKITSTPA 457 

Query: 408 QFDSDGINVQQFWTTSADGERIPYFHVGKNAAP DMPTLVYAYGGFGI PEL PHYLGS I 4 64 

+FD+ G+ QQFW TS DG ++PYF V + PT++YAYGGF IP P Y + 

Sbjct: 458 RFDAGGLQAQQFWATSKDGTKVPYFLVARKDVKLDGTNPTILYAYGGFQIPMQPSYSAVL 517 

Query: 465 GKYWLEEGNAFVLANIRGGGEFGPRWHQAAQGISKHKSVDDLLAWRDLSERGISSPEHI 52 4 

GK WLE+G A+ LANIRGGGEFGP+WH A ++ + DD AV +DL + ++S H+ 
Sbjct: 518 GKLWLEKGGAYALANIRGGGEFGPKWHDAGLKTNRQRVYDDFQAVAQDLIAKKVTSTPHL 577 

Query: 525 GLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVC 58 4 

G+ GGSNGGL+ ++ P A+V +VPL DM+ + +SAG+SW EYG+P V 

Sbjct: 578 GIMGGSNGGLLMGVQMIQRPDLWNAVVIQVPLLDMVNFTRMSAGASWQAEYGSPDD-PVE 636 

Query: 585 KRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGG 64 4 

L +SPYHN+ G+ YP TS DDRV PHAKA + +Y G 

Sbjct: 637 GAFLRSISPYHNVKAGVAYPEPFFETSTKDDRVGPVHARKMAALFEDMGLPFYYYENIEG 696 

Query: 64 5 GHTGNGTQRESADELACVLLFL 666 

GH +E A A +++ 

Sbjct: 697 GHAAAANLQEHARRYALEYIYM 718 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 163>: 

a041-l . seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAG CGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAAACAA CGACAAGGCA CGCGCATTGT CTGACGGCAT TTTGGCGCAG 

151 TTGCAGGACA CGCGGCAAAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

201 GTACCATTTC CAT C AAG AT G CGGAATATCC GAAAGGCGTG TACCGCGTGT 

251 GTACCGCGGC GACTTACCGT TCGGGCTATC CTGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGACGA ATTGCTCGGT GACGATGTAT ATCTAGGCGG 

351 CGTGTCGCAC CTGGTGGAAC AGCCCAACCG CGCGTTGTTA ACACTGAGCA 

4 01 AATCGGGCGG CGATACCGCG TACACGCTGG AAGTGGATTT GGAAGCAGGG 

4 51 GAGTTGGTAG AAGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG GACGAACGCC 

551 AGTTGACCGA ATCGGGCTAT CCGCGCGAGG TGTGGCTGGT GGAACGCGGC 

601 AAGAGTTTCG AGGAAAGCCT GCCGGTGTAC CAAATTGCTG AAGACGGCAT 

651 GATGGTGAAC GCGTGGCGTT ACCTCGATCC GCAGGGTTCG CCGATTGATT 

7 01 TGATTGAAGC GTCTGACGGT TTTTACACCA AAACCTATTT GCAGGTCTCA 

7 51 GCCGAAGGCG AAGCGAAACC GTTAAACCTG CCCAACGATT GCGACGTAGT 

801 CGGCTATCTG GCCGGACATC TTTTGCTGAC CTTGCGTAAA GACTGGCACC 

851 GCGCGAACCA AAGCTATCCG AGTGGCGCAT TGGTAGCAGT AAAATTAAAC 

901 CGCGGCGAAT TGGGCGCGGC GCAGCTTTTG TTTGCGCCCA ATGAAACGCA 

951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTCGTG GCGAGCCTGC 

1001 TGGAAAACGT ACAGGGTCGT CTGAAAGCGT GGCGTTTTAC TGATGGCAAA 

1051 TGGCAGGAAA CCGAGTTGCC GCGCCTGCCT TCGGGCGCGT TGGAAATGAC 

1101 CGACCAAGCG TGGGGGGGCG ACGTAGTTTA CCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCATTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC GCCAGCCGCA GCAGTTTGAT TCAGACGGCA TTAACGTGCA 

1251 GCAGTTTTGG ACGACTTCGG CTGACGGCGA GCGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACATGCCGA CGCTGGTCTA TGCCTACGGC 

1351 GGTTTCGGCA TTCCCGAATT GCCGCATTAT CTGGGCAGCA TTGGCAAATA 

14 01 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCGAACATC CGCGGCGGCG 
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14 51 GCGAGTTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAT 

1501 AAAAGCGTTG ATGATTTATT GGCAGTCGTG AGCGATTTGT CCGAACGCGG 

1551 TATCAGTTCG CCCGAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGAC 

1601 TGATTACTGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT AGGCGCGCTG 

1651 GTGTGCGAAG TGCCGCTGAC CGACATGATC CGTTATCCGC TGCTCTCCGC 

17 01 CGGTTCAAGC TGGACAGACG AATACGGCAA TCCGCAAAAA TACGAAGTCT 
1751 GCAAACGCCG GTTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 

18 01 AT C GAT T AT C CGCCCGCGCT CATTACCACC AGCCTGTCCG ACGATCGCGT 
1851 CCATCCCGCC CACGCGCTCA AGTTCTACGC CAAACTGCGC GAAACCTCGC 
1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 
1951 ACGCAGCGCG AAGCCGCCGA CGAACTCGCC TGCGTGTTGC TGTTTTTGAA 
2001 AGAGTTTTTG GGCTAA 

This corresponds to the amino acid sequence <SEQ ID 164* ORF 041-1 a>- 

a041-l .pep ' * 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLNNDKA RALSDGILAQ 

51 LQDTRQIPFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSKSGGDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW DERQLTESGY PREVWLVERG 

201 KSFEESLPVY QIAEDGMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLQVS 

251 AEGEAKPLNL PNDCDVVGYL AGHLLLTLRK DWHRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPNETQALE SVETTKRFVV ASLLENVQGR LKAWRFTDGK 

351 WQETELPRLP SGALEMTDQP WGGDVVYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERIP YFHVGKNAAP DMPTLVYAYG 

451 GFGIPELPHY LGSIGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAVV SDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQS1GAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSPQSWLYS PDGGGHTGNG 

651 TQREAADELA CVLLFLKEFL G* 

a041-l/m041-l 97.9% identity in 671 aa overlap 



a041-l .pep 
m041-l 



10 20 30 40 50 60 

MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLNNDKARALSDGILAQLQDTROIPFC 
11(111 MliMillMIUMIIMI If 111111:111 III I || MJM III IMMN 
MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIP^ 
10 20 30 40 50 60 

70 80 90 100 no ion 

a041-l.pep QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLG DDVYLGGVSH 

nyl1 , IMIIMIMIIIII Mill MINIMI III III I II MM MM Mill II 

10041-! QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADTO 



70 80 90 100 



110 120 



130 140 150 160 170 ion 

a041-l.pep LVEQPNRALLTLSKSGGDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 
n*, , I II II I III I I II I l-IMIIMIIIMMIIMIIIIIMIININIMIIIIIIM 

m041-l LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFH^ 

170 180 



!30 140 150 160 



a041-l.pep 
m041-l 



190 2 °0 210 220 230 240 

DE RQLTE S G Y PRE VWL VE RGK S FE E S L P VYQ I AE DGMMVN AWR YL D PQG S P I DL I E AS DG 

! N I I I : I I I I I I I I I I I M I I I I I ; I f| I I I I M I I M II I I 1 M 1 I I I I I I 

NERQLTQSGYPREVWLVERGKSFEESLPVYQIGEDGMMVNAWRYLDPQGSPIDL^ 

190 200 210 220 230 240 

250 260 270 280 290 ?nn 

a041-l.pep FYTKTYLQVSAEGEAKPLN^ 

1 1 1 1 1 1 I : I I I I i I I I i I I I I I I I I I I I I I I I M | | | | | | 1 | : | | || || || . i i I n | | , 
FYTKTYLRVSAEGEAKPLNLPNDCDWGYIAGHLLL 

250 260 270 280 290 300 

310 320 3 30 340 350 * fi n 

^ >N ^' S VE T T! KR. F WAS Xj LE M VOG RIjKA F"T e>g k w QE T" E U PR L p 

1 1 1 1 1 1 1 1 1 M 1 1 : 11 1 1 1 ' 'I I I M II I 1 I I I I I I M I M I M II -I I I I i I -I I M I i 
RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRL 
J1 ° 320 330 340 



m041-l 

a041-l .pep 
m041-l 



350 360 
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370 380 390 400 410 420 

SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 
t I I I M t I I I I I I I I I I I I I I I I I I I M I I I I I I 1 I I I I I I I I I I I I | I | | | | | | | | | | | 
SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 
370 380 390 400 410 420 

430 440 450 460 470 480 

TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPHYLGSIGKYWLEEGNAFVLANI 
M I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I II I I M I II I I I I I I I I || I 
TT S ADGER I PYFH VGKN AAPDM PTLV YA YGG FG I PEL PHY LG S IGKYWLEEGNAFVLAN I 

430 440 450 460 470 480 

490 500 510 520 530 540 

RGGGEFGPRWHQAAQGISKHKSVDDLLAWSDLSERGISSPEHIGLQGGSNGGLITAAAF 
I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I II I I | i | | | | | | | M | | 
RGGGE FGPRWHQAAQGISKHKSVDDLLAVVRDLSERGISSPEHIGLQGGSNGGLITAAAF 

490 500 510 520 530 540 

550 560 570 580 590 600 

VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
I I I I I I I II I I I I I I I II I I I I I I I j I I I I I M I I I I I I I I I E I I I I I M | | | | | | M I I 
VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
550 560 570 580 590 600 

610 620 630 640 650 660 

IDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSWLYSPDGGGHTGNGTQREAADELA 
If I II I I I I I I I II I I I I I I I I I I i I I II ! I I I I II I I M I I I I I I I || | I | | : | | | | | 
IDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGGGHTGNGTQRESADELA 

610 620 630 640 650 660 

670 

CVLLFLKEFLGX 
I II I I I I I I I 11 
CVLLFLKEFLGX 
670 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 165>: 

g042 . seq 

1 ATGACGATGA 

51 GTTATCCAAC 

101 TGCGGTCGAT 

151 AAGGAAACGG 

201 CGGCAGGCCG 

251 CGAAGGCGGA 

301 TTGCCTTTGG 

351 ATTCAATTCC 

401 CCATCTGCTT 

4 51 TCTATGgtgG 

501 CCCGCCGGTA 

551 CGGGCTCAAA 

601 AAATAG 

This corresponds to the amino acid sequence <SEQ ID 166; ORF 042.ng>: 

g042.pep 

1 MTMICLRFQA FVPHTSALSN TSTAAGPSCP MAAVRSMMKI QPGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRFWA NSASICAFNS ATRASLPKIR DR VSICFSPL VRILPLSTV K 

151 SMVVAFFANC SYASAPGPPV MTNCGLWRCR DSQSGSNSVP TVAALSNAGC 

201 K* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 167>: 



BNSDOCID: <WO 9957280A2J_> 



a041-l . pep 
m041-l 

a041-l . pep 
m041-l 

a041-l . pep 
m041-l 

a041-l . pep 
m041-l 

a041-l .pep 
m041-l 

a041-l . pep 
IU041-1 



TTTGCTTGCG CTTCCAagcG 
ACTTCCACGG CAGCCGGCCC 
GATGAAAATC CAGCCGGGGT 
GCTGCCCGTG CCCTTCGTTG 
ATGTCGCCGT GTATCCAACT 
CACCTTGTTG CCTGTAACCG 
cggCTTCGCG CTTTTGGGCG 
GCCACGCGCG CTTCCTTACC 
CTCGCCGCTG GTGCGGATAT 
TCGCGTTTTT CGCCAACTGT 
ATGACAAACT GCGGATTGTG 
CAGCGTCCCC ACCGTTGCCG 



TTCGTGCCGC ATACCAGCGC 
TTCCTGCCCG ATGGCGGCGG 
TTTTCTCTTT GATGTATTCG 
CGTAAAGATT CGTCCACGGG 
TGCCAACCGC GATTGCGTGC 
ACAGCACCAG CCCGCGTCCT 
AACAGCGCGT CAATCTGCGC 
GAAAATCCGC GACAGGGTCT 
TGCCGTTGTC CACCGTCAAA 
TCATACGCTT CCGCACCCGG 
GCGGTGCAGG GATTCGCAAT 
CCTTGTCAAA TGCAGGCTGC 
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m042 . seq 




1 


ATGACGATGA 


51 


GTTATCCAmT 


101 


TACGGTCGAT 


151 


AAGGAAACAG 


201 


CGGTAGGCCG 


251 


CGAAGGCGGA 


301 


TTGCCTTTGG 


351 


CTTCAATTCC 


401 


CCATCTGCTT 


451 


TCTATGGTGG 


501 


CCCGCCGGTA 


551 


CGGGCTCGAA 


601 


AAATAA 



TTTGCTTGCG CTTCCAAGCG 
ACTTCGACAG CCGcCGGCCy 
GATGAAAATC CAATCGGGGT 
GCTGCCCGTG CCCCTCGTTG 
ATGTCGCCGT GTATCCAACT 
CACCTTGTTG CCCGTAACCG 
CGGCTTCGCG CGTTTGGGCG 
GCCGCGCGCG CTTCCTTGCC 
TTCGCCGCTG GTGCGGATAT 
TCGCGTTTTT CGCTAACTGT 
ATGACAAGCT GAGGATTGTA 
CAGCGTCCCC ACCGTTGCCG 



TTCGTGCCGC GTACCAGCGC 
TTCyTGCCCG ATGGCGGCGG 
TTTTCTCTTT GATGTATTCG 
CGTAAAGATT CGTCTACAGG 
TGCCAACCGC GACTGCGTGC 
ACAGCACCAG CCCGCGTCCT 
AACAGCGCGT CAATCTGCGC 
GAAAATCCGC GCCAAGGTCT 
TGCCGTTGTC CACCGTCAGA 
TCATACGCTT CCGCGCCCGG 
GCGGTGCAGG GCTTCGTAAT 
CCTTGTCAAA TGCAGGCTGC 



This corresponds to the amino acid sequence <SEQ ID 168; ORF 042>: 

m042 .pep 

1 MTMICLRFQA FVPRTSALSX TSTAAGXSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR A KVSICFSPL VR1LPL STVR 

151 SMWAFFANC SYASAPGPPV MTSXGLXRCR ASXSGSNSVP TVAALSNAGC 

201 K* 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 169>- 

a042.seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

2 01 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 
251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 
301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

3 51 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

4 01 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 
4 51 TCTATGGTGG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCGCCCGG 
501 CCCGCCGGTA ATGACAAGCT GAGGATTGTA GCGGTGCAGG GCTTCGTAAT 
551 CGGGCTCGAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 
601 AAATAA 

This corresponds to the amino acid sequence <SEQ ID 170; ORF 042.a>: 

a042.pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR A KVSICFSPL VRILPLSTV R 

151 SMWAFFANC SYASAPGPPV MTS*GL*RCR AS*SGSNSVP TVAALSNAGC 



m042/a042 99.0% identity over a 201 aa overlap 

10 20 30 40 50 60 

m04 2.pep MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I I I I I I I I I I f I 1 I I I I I I | | | [ | | I M I I I I I M I I I I i I I I | M I I I I I I I I M I I 
a042 MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 2.pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
M I I I I I I I I II I! I I I | | | | | | | j | M I I I I I | | j | | 1 | | M I I 1 I I I! I | | | | M | | | 
a042 RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 180 

m042 . pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSXGLXRCR 
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I I I I I I I i I I t I I I I I I I I I I M I I I I I I I I I I I I I I I I I I | I I | | M | | | | | | | | | | | | 
a04 2 AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSXGLXRCR 

130 140 150 160 170 180 

190 200 
m04 2 . pep ASXSGSNSVPTVAALSNAGCKX 
I I I I I I I M I I I II I I I I I I I I 
a04 2 ASXSGSNSVPTVAALSNAGCKX 

190 200 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 042 shows 93.0% identity over a 201 aa overlap with a predicted ORF (ORF 042.ng) 
from N. gonorrhoeae: 

m042/g042 

10 20 30 40 50 60 

m042.pep MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I I I I I M I I I I I I : I I I I I I I I I I I I I I I I I II j I I I I | I I I I I I I I I I I I I | | | I | 
g04 2 MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVRSMMKIQPGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 2.pep RKDS STGGRPMS PCIQLANRDCVPKADTLLPVT DSTS PRPLPLAASRVWANSAS I CAFNS 

I I I I I I t I I I II I I I I I I I I I J I I i I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I II 
g04 2 RKDS STGGRPMS PCI QLANRDCVPKADTLLPVTDSTSPRPLPLAASRFW AN SASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 180 

m04 2.pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSXGLXRCR 
1:11111111 : M I I I I t I I I I I ! I I I I : I I I I I I ! I I I I I I I I I I I I M f : II III 
g042 ATRASLPKIRDRVSICFSPLVRILPLSTVKSMWAFFANCSYASAPGPPVMTNCGLWRCR 

130 140 150 160 170 180 

190 200 
m04 2 .pep ASXSGSNSVPTVAALSNAGCKX 
I I I I I I I I I I I I I I I I I I I I 
g042 DSQSGSNSVPTVAALSNAGCKX 

190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 171>: 

m042-l.seq 

1 ATGACGATGA TTTGCTTGCG 

51 GTTATCCAAT ACTTCGACAG 

101 TACGGTCGAT GATGAAAAT C 

151 AAGGAAACAG GCTGCCCGTG 

201 CGGTAGGCCG ATGTCGCCGT 

251 CGAAGGCGGA CACCTTGTTG 

301 TTGCCTTTGG CGGCTTCGCG 

351 CTTCAATTCC GCCGCGCGCG 

4 01 CCATCTGCTT TTCGCCGCTG 

4 51 TCTATGGTGG TCGCGTTTTT 

501 CCCGCCGGTA A 

This corresponds to the amino acid sequence <SEQ ID 172; ORF 042-1 >: 

m042-l . pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 
51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 
101 LPLAASRVWA NSASICAFNS AARASLPKIR A KVSICFSPL VRILPLSTV R 
151 SMWAFFANC SYASAPGPPV MTS* 

m042-l/g042 95.4% identity in 173 aa overlap 



CTTCCAAGCG 
CCGCCGGCCC 
CAATCGGGGT 
CCCCTCGTTG 
GTATCCAACT 
CCCGTAACCG 
CGTTTGGGCG 
CTTCCTTGCC 
GTGCGGATAT 
CGCTAACTGT 



TTCGTGCCGC 
TTCCTGCCCG 
TTTTCTCTTT 
CGTAAAGATT 
TGCCAACCGC 
ACAGCACCAG 
AACAGCGCGT 
GAAAATCCGC 
TGCCGTTGTC 
TCATACGCTT 



GTACCAGCGC 
ATGGCGGCGG 
GATGTATTCG 
CGTCTACAGG 
GACTGCGTGC 
CCCGCGTCCT 
CAATCTGCGC 
GCCAAGGTCT 
CACCGTCAGA 
CCGCGCCCGG 
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10 20 30 40 50 60 

m04 2-l.pep MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
N I I I I I I I I I I I : I I I I I I I I I I I I I I I | | | | | | | | | | | | | | m I I M I I I I I I I M | 
9° 42 MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVRSMMKIQPGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 HO 120 

m042-l . pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 

M I I I M I I I I I I I I I I I I I I I I I I | i | | | | | | f| | | | | ,, | | [ I I I I M I I I 

S° 42 RKDSSTG GRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRFWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 

m042-l.pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSX 

I = M I I I I I I • M I 1 I I I I I I I I I f | ! | : | | | M 1 M I M I I I i I M I M I : 
g042 ATRASLPKIRDRVSICFSPLVRILPLSTVKSMWAFFANCSYASAPGPPVNTNCGLWRCR 

130 140 150 160 170 180 

g042 DSQSGSNSVPTVAALSNAGCKX 

190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 173>* 

a042-l. seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

4 01 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

451 TCTATGGTGG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCGCCCGG 

501 CCCGCCGGTA A 

This corresponds to the amino acid sequence <SEQ ID 174; ORF 042-l.a>: 

a042-l.pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 
51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 
101 LPLAASRVWA NSASICAFNS AARASLPKIR AKVSICFSPL VRILPLSTVR 
151 SMWAFFANC SYASAPGPPV MTS* 

m042-l/a042-l 100.0% identity in 173 aa overlap 

10 20 30 40 50 60 

m042-l.pep MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
A/10 , M I I ! I 1 I I I I I I i I | | | | | | | | M tl I I I I | | M i | | | | | M | | | | | | t | | j | ,,,,, | 

a042-l MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

n>io 7 0 80 90 100 110 120 

m04 2-l.pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 

n _ , I M M M M i I I I I I I i I | | f | | | | | | | | | j | II MINI MINI II II Ml II 

a042-l RKDSSTGGRPMSPCIQLANRDCVPPCADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 

m042-l.pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSX 
MNIMIIIIMIIIIIIIIIIilMiMMiiMiiii,,,,,,!,!,,!,! 

3042-1 AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSX 

!30 140 150 160 170 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 175>: 
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g043 . seq 

1 ATGGTTGTTT CAAATCAAAA 

51 TCACATCCGA AGGCAAAAAT 

101 CGTCCCGCGT ggcggtagcc 

151 GCCCGATTCG ATGAGGGcga 

201 GTCCGGCGAC GGTTTCGCCG 

251 CATTCGTTCA GCCCGACGCG 

301 GGCGAATTCG CTGTTCAAAA 

351 GGTAGCCGTT GTCGTGGCGG 



TATCTATGCC GTCGGCCCAT CAGCACTTTT 

CCGTAATGCC GCCTGAACGC TTCgttgaAC 

gcAAAAGTGC ATcGCGGCTT GGATGGTGCT 

gcGCGTGTTC CAGCCGCAGG CGGCGCAGGC 

GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

GCGGGCGATT TCGGCGATGG TCAGCGGGCG 

TATCGGCGGC TTCGTCTATG CGCCGGCGGC 
CGGAAGGTGA AGCGTAA 



This corresponds to the amino acid sequence <SEQ ID 176; ORF 043 .ng>: 

g043 . pep 

1 MWSNQNIYA VGPSALFHIR RQKSVMPPER FVEPSRVAVA AKVHRGLDGA 

51 ARFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQPDA AGDFGDGQRA 

101 GEFAVQN IGG FVYAPAAVAV WAA EGEA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 177>: 

m043.seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 

101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 

151 GCCGGATTCG ATGAGGGCGA GCGCGTGTTC CAGCCGCAGG CGGCGCAgGC 

201 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

2 51 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGACG 

301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

351 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 

This corresponds to the amino acid sequence <SEQ ID 178; ORF 043>: 

m04 3.pep 

1 MVVSNQNIYA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVHGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQSDA AGDFGDGQRT 

101 GEFVLQDVGG FVYAPTAVTV WAAEGEAQ* 

Computer analysis of the amino acid sequences gave the following results: 
Homology with a predicted ORF from N. meningitidis menA with menB 

ORF 043 shows 89.8% identity over a 128 aa overlap with a predicted ORF (ORF043.a) from . 
N. gonorrhoeae: 

m043/g043 

10 20 30 40 50 60 

m04 3 . pep MWSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 
I I I I I I I I I I : I I I I I : I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I 
g04 3 MWSNQNIYAVGPSALFHIRRQKSVMPPERFVEPSRVAVAAKVHRGLDGAARFDEGERVF 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 3 . pep QPQAAQASGDGFAGLRFEIAFQVAFVQSDAAGDFGDGQRTGEFVLQDVGGFVYAPTAVTV 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I ! : : I : : M I I I I I : I I : I 
g04 3 Q PQAAQAS G DG FAGLR FE I AFQVAFVQ P DAAGD FG DGQRAGE FAVQN I GG FVYAPAAVAV 

70 80 90 100 110 120 



130 

mO 4 3 . pep VVAAEGEAQX 

I I I I I I I 1 
g04 3 VVAAEGEAXX 

130 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 179>: 

a043 . seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 

101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 
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151 GCCGGATTCG ATGAGGGCGA GCGCGTGTTC CAGCCGCAGG CGGCGCAGGC 

2 01 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAAT AG CG TTTCAGGTAG 

251 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGACG 

301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

351 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 

This corresponds to the amino acid sequence <SEQ ID 180; ORF 043. a>: 

a043.pep 

1 MWSNQNIYA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVHGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQSDA AGDFGDGORT 

101 GE FVLQD VGG FVYAPTAVTV WAA EGEAO * 

ro043/a043 100.0% identity in 129 aa overlap 



mO 4 3 . pep 



10 20 30 40 50 60 

MVVSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 



''''''''' I I ' I I I I I I I II I M I I I I I | | | | | | | | | | | | | | | j | | | | | U I II 

a043 MVVSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERV^ 

10 20 30 40 50 60 

70 80 90 100 HO 120 

m04 3 . pep Q PQAAQASG DGFAGLRFE I AFQVAFVQSDAAGDFGDGQRTGE FVLQDVGG FVYAPTAVTV 

''''''' 1 1 1 1 1 1 1 I I I I I I I I I I I I I I I I I I I I I I I | | | | | | M I I I I I I | | I I I I I I | 
Q PQAAQASG DGFAGLRFE I AFQVAFVQSDAAGDFGDGQRTGE FVLQDVGG FVYAPTAVTV 
70 80 90 100 110 120 



a043 



130 

mO 4 3 . pep WAAEGEAQX 
I I M I M i I I 
aO 4 3 WAAEGEAQX 

130 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 181>* 

g044.seq 

1 ATGCTGCCCG ACCAGAGCGT CGAGTTCTTG CCACAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGCCC TTCGATAACG GCGGTCAGCT 

2 01 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 CGGCTGCCGT AGCGCATTAA 

This corresponds to the amino acid sequence <SEQ ID 1 82" ORF 044 ne>* 

g044.pep ' B 

1 MLPDQSVEFL PQ WVFDGLF GGGFPAVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQP FDNGGQLHAV VGGLRFAAEK FFFAAAVAH* 

The following partial DNA sequence was identified in K meningitidis <SEO ID 183>- 

m044 . seq ^ w ^ ' 

1 ATGCCGTCCG ACTAGAGCGT CGAGTTCTTT CCAGAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGAGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCATTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCAGTCAGTT 

201 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 TGGCTACCGT AGCGCAyTAa 

This corresponds to the amino acid sequence <SEQ ID 184- ORF 044>* 

m044.pep ' 

1 MPSDXSVEFF PE VWFDGLF GGGFPAVAL P TVYPVFHAIF DVLRVGADDD 
51 GAAAFERFQS FDDGS QFHAV VGGLRFAAEK FFFVATVAH* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 185>: 

a04 4 . seq 

1 GTGCCGTCCG ACCAGCGCGT CGAGTTCTTT CCACAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCGGTCAGTT 

201 CCATACGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 TGGCTGCCGT AGCGCATTAA 

This corresponds to the amino acid sequence <SEQ ID 186; ORF 044.a>: 

a04 4 . pep 

1 VPSDQRVEFF FQ VWFDGLF GGGFPAVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQS FDDGGQFHTV VGGLRFAAEK FFFVAAVAH* 

m044/a044 91.0% identity over a 89 aa overlap 



10 20 30 40 50 60 

m04 4 . pep MPSDXSVEFFPEVVVFDGLFGGGFPAVALPTVYPVFHAIFDVLRVGADDDGAAAFERFQS 
: I M I I I I I : I i M ! I I i I I I ! I I I 1 I M I M I I I I : I I 1 I 11 I I I I M M I M I i II 
a04 4 VPSDQRVEFFPQWVFDGLFGGGFPAVALPTVYPVFHAVFDVLRVGADDDGAAAFERFQS 

10 20 30 40 50 60 



70 80 90 

m04 4 . pep FDDGSQFHAWGGLRFAAEKFFFVATVAHX 
M I I : I I I : I I I I I I I I I I I M I I t : II I I 
a04 4 FDDGGQFHTWGGLRFAAEKFFFVAAVAHX 

70 80 90 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 044 shows 86.5% identity over a 89 aa overlap with a predicted ORF (ORF 044.ng) 
from N. gonorrhoeae: 

m044/g044 

10 20 30 40 50 60 

m044 . pep MPSDXSVEFFPEVVVFDGLFGGGFPAVALiPTVYPVFHAI FDVLRVGADDDGAAAFERFQS 

I I M I i : i : I i ! I I M ; i ! M J i i : I I i I I I I I i II i I I M I I ! i 

g044 MLPDQSVEFLPQVWFDGLFGGGFPAVALPTVYPVFHAVFDVLRVGADDDGAAAFERFQP 

10 20 30 40 50 60 

70 80 90 

m0 4 4 . pep FDDGSQFHAWGGLRFAAEKFFFVATVAHX 
I h I : I : I II I I I II I I I 11 I I I : I : | | | | 
g 0 4 4 FDNGGQLHAWGGLRFAAEKFFFAAAVAHX 

70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 187>: 

g046 . seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAGC CCGCCGCgcc gCGCCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC g a AT ATGGAA AGGCTGCCGt TTTcGTTGTT 

2 01 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TtcgctGGAA CGGACGCGCG 

2 51 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

3 01 ATGTTGGTTT CGTCGCTGCG GGagaGCGCG AGcagcaagt cggcatcttC 

3 51 CgcgccggcG Cgttataatg tgAAGGGGGA TGCGccgttg c eg a AAACGG 

4 01 TTTGGacatc gaggeggctg CCTGTTTCCT GCAATGCTTT TTCGTCGATG 
4 51 TCGATAAcgg TTACGTCGTT GTTGGTGATG GCGGCAAGGT TTTGCGCGAC 
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.5 01 GGTAGAACCT ACCTGCCCGT TGCCTAAAAT GAGGATTTTC ACGGTATGGG 
551 TCGCCGGGTG A 

This corresponds to the amino acid sequence <SEQ ID 188; ORF 046.ng>: 

g046.pep 

1 MSAMLRPTSS P P RRACMMT I RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RYNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 S ITVTSLLVM AARFCATV EP TCPLPKMRIF TVWVAG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 89>- 

m04 6.seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAsT CCGC.r.sGC gCGcCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC GAATATGGAA AGGCTGCCGT TTTCGTTGTT 

2 01 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TTCG CTGGAA CGGACGCGCG 
251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

3 01 ATGTTGGTTT CGTCGCTGCG GGAGAGCGCG AGCAG CAAGT CGGCATCTTC 
351 CGCGCCGGCG CGTTCTAATG TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 

4 01 TTTGGACATC GAGGCGGCTG CCTGTTTCCT GCAATGCTTT TTCGTCGATG 
451 TCGATAACGG TTACGTCGTT GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 
501 GGTAGAACCT ACCTGTCCGT TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 
551 TCGCCGAGTG A 

This corresponds to the amino acid sequence <SEQ ID 190; ORF 046>: 

m046.pep 

1 MSAMLRPTSX PXXRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 S I TVTSLLGM AARFCATV EP TCPLPKMRIF TVWVAE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 191>: 

a046.seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAGT CCGCCGCGCC GCGCCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC GAATATGGAA AGGCTGCCGT TTTCGTTGTT 

201 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TTCGCTGGAA CGGACGCGCG 

251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

301 ATGTTGGTTT CGTCGCTGCG GGAGAGCGCG AGCAGCAAGT CGGCATCTTC 

351 CGCGCCGGCG CGTTCTAATG TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 

4 01 TTTGGACATC GAGGCGGCTG CCTGTTTCCT GCAATGCTTT TTCGTCGATG 

4 51 TCGATAACGG TTACGTCGTT GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 

501 GGTAGAACCT ACCTGTCCGT TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 

551 TCGCCGAGTG A 

This corresponds to the amino acid sequence <SEQ ID 192; ORF 046.a>: 

a04 6 .pep 

1 MSAMLRPTSS PPRRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 SI TVTSLLGM AARFCATVEP TCPLPKMRIF TVWVAE * 

m046/a046 98.4% identity over a 1 86 aa overlap 

10 20 30 40 50 60 

m04 6.pep MSAMLRPTSXPXXRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 
111111,1,1 1 11 1 I I I I I I I I M f I I 1 I I I I | | | | | ]| | | M I I I I I I | | | | | It I 
. a046 MSAMLRPTSSPPRRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVS^PNME 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m04 6 . pep RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 
i M I I I I I I I I I ! I I I M I It I I I I I I M I I I I I I I I I I I I M I I I i II t I I I I I M i I I 
a04 6 RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m04 6 . pep RSNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 
I J I I I I I M I I II I I I I I I I I I I I I } I I I I M I I I I II I I I I I I I I I I I II M I I I I I I I 
a04 6 RSNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 

130 140 150 160 170 180 

m04 6.pep TVWVAEX 
I I I I I I I 

a04 6 TVWVAEX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 046 shows 97.3% identity over a 185 aa overlap with a predicted ORF (ORF 046.ng) 
from N. gonorrhoeae: 

m046/g046 

10 20 30 40 50 60 

MSAMLRPTSXPXXRACMMTIRTRSSAKRKTOTAPGQSIRPASCSVTSCSGLMVSVMPNME 

Illllllll I IIIIMIIIIIMIIIMIMIIIIIIIMIIMIIIIMIIIIIII 

MS AMLRPTSS PPRRACMMTI RTRSS AKRKTCNAPGQS I RPAS CS VTSCSGLMVS VMPNME 
10 20 30 40 50 60 

70 80 90 100 110 120 

RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j f i f r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i j 1 1 1 1 1 1 

RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 
70 80 90 100 110 120 



m04 6 .pep 
g046 

m04 6 .pep 
g046 



130 140 150 160 170 180 

mO 4 6 . pep RSNVKGDAPLPKTVWTSRRLPVS CNAFS SMS I TVTS LLGMAARFCATVE PTCPLPKMR I F 

I Mill I III ,1 IN MIMIM III II IMIM II IIIIMIIIIIIIIIIIIIM 

g046 RYNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLVMAARFCATVEPTCPLPKMRIF 

130 140 150 160 170 180 



m046.pep TVWVAEX 

Mill 

g046 TVWVAGX 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 193>: 

g047 . seq 

1 ATGGTCATCA TACAGGCGcg gcGCGGCGGG CTGCTTGTCG GACGCAGCAT 

51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT GTTTGCCGCC GCCGCCGAAA ACATCGGGGC 

2 01 GGTCATACCc gaATTGCGCC C C AAAGAAAC CAGCACCCGC CGCATCATGA 

251 TTGCCGGCGG CGGCAACATc tgctACCGCC TCGCCAAG C A GCTCGAACAC 
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3 01 GCATAcaacG TCAAAATCAT CGAATGCCGG CCGCGCcgtg ccgaATGGAT 

3 51 AGC CGAAAAC ctcgAcaaCA CCCTCGTCCT GCAAGGTTCG Gcaaccgacg 
401 aAaccctgct cgAcaacgaa tacatcgacg aaatcgaCGT ATTCTGCGCC 

4 51 CTGACCAACG ACGACGAAAG CAACATTAtg tCCGCCCTTT TGGCGAAAAA 

5 01 CCTcggcgCG AAGCgcgtca tcggCATCGT CAACCGCTCA AGCTACGTCG 
551 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 
601 ACCATCGGCT CGATACTCGC CCACATCCGG CGCGGCGACA TCGTTGCCGT 
651 CCACCCCATC CGGCGCGGCA CGGCGGAAGC CATCGAAGTC GTCGCGCACG 
701 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGCATCAG CGGCATCAAA 
751 TGGCCCGAAG G CTGCCAC AT TGCCGCCGTC GTCCGCGCCG GAAC CGGCGA 
801 AACCATTATG GGACACCATA CCGAAACCGT CATCCAAGAC GGTGAC CACA 
851 TCATCTTTTT CGTCTCGCGC CGGCGCATCC TGAACGAACT GGAGAAACTC 
901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 

This corresponds to the amino acid sequence <SEQ ID 194; ORF 047 ne>- 

g047.pep ' B 

1 MVIIQARRGG LLVGRS IAD I AQDL PDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETSTR RIMIAGGGNI CYRLAKQLEH 

101 AYNVKIIECR PRRAEWIAEN LDNTLVLQGS ATDETLLDNE YIDEIDVFCA 

151 LTNDDESNIM SALLAKNLGA KRVIGIVNRS SYVDLLEGNK IDIWSPHLI 

201 TIGSILAHI R RGDIVAVHPI RRGTAEAI E V VAHGD KKTS A IIGRRISGIK 

251 WPEGCHIAAV VRAGTGET I M GHHTETVIQD GDHIIFFVSR RRILNELEKL 

3 01 IQVKMGFFG* 

The following partial DNA sequence was identified in N. meningitidis <SEO ED 195>- 

m047 . seq ^ * 

1 ATGGTCATCA TACAGgCGcG C . . syGCGGA sTGCTTGTCG GACGCAGCAT 

51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT ATTTGCCGCC GCCGCCGAAA ACATCGGCGC 

201 GGTCATACCC GAATTGCGCC CCAAAGAAAC CCAAAGAAAC CAGCcCmgmm 

251 GcATCATGAT TkCCGGCGGC GGCAACATCG GCTACCGTCT CGCCAAGCAG 

3 01 CTCGAACACG CATACAACGT yAAAATCATC GAATGCCGGC CGCGCCGTGC 

351 CGAATGGATA GCCGAAAACC TCGACAACAC CCTCGTCyTG CAAGGTTCGG 

401 CAACCGACGA AACCCTGCTC GACAACGAAT ACATCGACGA AATCGACGTA 

451 TTCTGCGCCC TGACCAACGA CGACGAAAGC AACATTATGT CCGCCCTTTT 

501 GGCGAaAAAC CTCGGCGCGA AGCGCGTCAT CGGCATCGTC AACCGCTCAA 

551 GCTACGTCGA TTTGCTCGAA GGCAACAAAA TCGACATCGT CGTCTCCCCC 

601 CACCTCATCA CCATCGGCTC GATACTCGCC CACATCCGGC GCGGCGACAT 

651 CGTTGCCGTC CACCCCATCC GGCGCGGCAC GGCGGAAGCC ATCGAAGTCG 

701 TCGCACACGG CGACAAAAAA ACTTCCGCCA TCATCGGCAG GCGCATCAGC 

751 GGCATCAAAT GGCCCGAAGG CTGCCACATT GCCGCCGTCG TCCGCGCCGG 

801 AACCGGCGAA AC CATTATGG GACACCATAC CGAAACCGTC ATCCAAGACG 

851 GCGACCACAT CATCTTTTTC GTCTCGCGCC GGCGCATCCT GAACGAACTG 

901 GAAAAACTCA TCCAGGTCAA AATGGG CTTT TTCGGATAA 

This corresponds to the amino acid sequence <SEQ ID 196- ORF 047>- 

m047.pep 9 

1 MVIIQARXXG XLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETQRN QPXXIMIXGG GNIGYRLAKQ 

101 LEHAYNVKI I ECRPRRAEWI AENLDNTLVL QGSATDETLL DNEYIDEIDV 

151 FCALTNDDES NIMSALLAKN LGAKRVIGIV NRSSYVDLLE GNKIDIWSP 

2 01 HLITIGSILA HIRRGDIVAV HPIRRGTAEA I E WAHGDKK TSAIIGRRIS 
251 GIKWPEGCHI AAWRAGTGE TIMGHHTETV IQDGDHIIFF VSRRRI LNEL 

3 01 EKLIQVKMGF FG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 97> : 

1 ATGGTCATCA TACAGGCGCG GCGCGGCGGA CTGCTTGTCG GACGCAGCAT 
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51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT ATTTGCCGCC GCCGCCGAAA ACATCGGCGC 

2 01 GGTCATACCC GAATTGCGCC CCAAAGAAAC CAGCACCCGC CGCATCATGA 
251 TTGCCGGCGG CGGCAACATC GGCTACCGTC TCGCCAAGCA GCTCGAACAC 

3 01 GCATACAACG T CAAAATC AT CGAATGCCGG CCGCGCCGTG CCGAATGGAT 
351 AGCCGAAAAC CTCGACAACA CCCTCGTCCT GCAAGGTTCG GCAACCGACG 

4 01 AAACCCTGCT CGACAACGAA TACATCGACG AAATCGACGT ATTCTGCGCC 
4 51 CTGACCAACG ACGACGAAAG CAACATTATG TCCGCCCTTT TGGCGAAAAA 
501 CCTCGGCGCG AAGCGCGTCA TCGGCATCGT CAACCGCTCA AGCTACGTCG 
551 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 
601 ACCATCGGCT CGATACTCGC CCACATCCGG CGCGGCGACA TCGTTGCCGT 
651 CCACCCCATC CGGCGCGGCA CGGCGGAAGC CATCGAAGTC GTCGCACACG 
701 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGCATCAG CGGCATCAAA 
7 51 TGGCCCGAAG GCTGCCACAT TGCCGCCGTC GTCCGCGCCG GAACCGGCGA 
801 AACCATTATG GGACACCATA CCGAAACCGT CATCCAAGAC GGCGACCACA 
851 TCATCTTTTT CGTCTCGCGC CGGCGCATCC TGAACGAACT GGAAAAACTC 
901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 

This corresponds to the amino acid sequence <SEQ ID 198; ORF 047. a>: 

a0 4 7 .pep 

1 MVI IQARRGG LLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETSTR RIMIAGGGNI GYRLAKQLEH 

101 AYNVKIIECR PRRAEWIAEN LDNTLVLQGS ATDETLLDNE YIDEIDVFCA 

151 LTNDDESNIM SALLAKNLGA KRVIGIVNRS SYVDLLEGNK ID IWSPHLI 

201 TIGSILAHI R RGDIVAVHPI RRGTAEAIEV VAHGDKKTSA IIGRRISGIK 

251 WPEGCHIAAV VRAGTGETIM GHHTETVIQD GDHIIFFVSR RRILNELEKL 

301 IQVKMGFFG* 

m047/a047 96.5% identity over a 312 aa overlap 

10 20 30 40 50 60 

MVI IQARXXGXLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVI IEGDEILFAA 
Mlllll I I I I M I I I M I I IE M E I I I M I I I I 1 I I t I 11 I I I 11 I IE I I I I I I I I 
MVI IQARRGGLLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVI IEGDEILFAA 
10 20 30 40 50 60 

70 80 90 100 110 120 

AAENIGAVIPELRPKETQRNQPXXIMIXGGGNIGYRLAKQLEHAYNVKIIECRPRRAEWI 
I II I M M I I I I I I I I I : : III | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

AAENIGAVIPELRPKETSTRR IMIAGGGNIGYRLAKQLEHAYNVKIIECRPRRAEWI 

70 80 90 100 110 

130 140 150 160 170 180 

AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 
I I I I I I I I I M I I I I I I I I I M I I I I I I II I I I I I I I I I II I M I I I I I I I I I I I t I I I I 
AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 
120 130 140 150 160 170 

190 200 210 220 230 240 

NRSSYVDLLEGNKIDIVVSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEWAHGDKK 
M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I IE I I I I I I I I I I 1 I I ! I I I I I I It I i 
NRSSYVDLLEGNKIDIWSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEWAHGDKK 
180 190 200 210 220 230 

250 260 270 280 290 300 

TSAIIGRRISGIKWPEGCHIAAWRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 
M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I f I M I M I IE I I I I 
TSAI IGRRI SGIKWPEGCHIAAWRAGTGETIMGHHTETVIQDGDHI I FFVSRRRILNEL 
240 250 260 270 280 290 

310 

EKLIQVKMGFFGX 
I I I I I I I I I I II I 



m047 .pep 
a047 

m04 7 . pep 
a047 

m04 7 .pep 
a047 

m04 7 . pep 
a047 

mO 4 7 . pep 
a047 

m04 7 . pep 
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a0 4 7 EKLI QVKMGFFGX 

300 310 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 047 shows 96.2% identity over a 3 12 aa overlap with a predicted ORF (ORF 047 ne) 
from TV. gonorrhoeae: ' 



m047/g045 




m04 7 


•pep 


MVIIQARXXGXLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVIIEGDEILFAA 

M 1 1 1 1 1 | | M 1 1 1 M 1 1 1 1 II 1 1 f 1 1 1 1 1 1 1 i I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 


g047 




1 1 1 1 1 1 1 1 1 i i m ii i ii m | | | | | | | i g | | | | | | | | | 
MVI IQARRGGLLVGRS IADI AQDLPDGADCQICAVYRNNRLI VPAPQTVI I EGDEILFAA 


m047 


.pep 


AAENIGAVIPELRPKETQRNQPXXIMIXGGGNIGYRIAKQLEHAYNVKIIECRPRRAEWI 

MIIMIMIMMIih : Ml Mill MIIIIIIIIIIIIMIMIMIIII 


g047 




AAENI GAVI PELRPKETSTRR - - - IMIAGGGNICYRLAKQIjEHAYNVKI IECRPRRAEWI 


m047 


.pep 


AENIJ3NTLVLQGSATDETLLDNEYIDEIDVFCAJLTNDDESNIMSALLAKNLGAKRVIGIV 


g047 




II IIIMMIIII Ml II Mill III II HUM II Mil Ml III Mil III II || IN 

AENLDNTLVLQGSATDETLIjDNEYIDEIDVFCTUjTNDDESNIMSALIjAKNLGAKRVIGIV 


m04 7 . 


■pep 


NRSSYVDLLEGNKIDIWSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEWAHGDKK 

1 1 1 1 1 M M 1 II II II 1 1 II 1 1 1 1| 1 1 II II 1 II II | M II 1 1 M 1 II 1 II II itl M M 

NRSSYVDLLEGNKTDIWSPHLITIGSII^IRRGDIVAVHPIRRGTAEAIEWAHGDKK 


g047 




m047 . 


pep 


TSAIIGRRISGIKWPEGCHIAAWRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 


g047 




1 1 1 1 1 1 1 1 1 1 II 1 M 11 1 1 11 1 M 1 1 1 11 1 1 1 11 1 II 1 1 11 1 1 1 1 1 1 II II 1 1| II 1 1 1 1 

TSAIIGRRISGIKWPEGCHIAAVVRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 


m047 . 


pep 


EKLIQVKMGFFGX 313 


g047 




lllllllllllll 
EKLI QVKMGFFGX 310 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 199>- 

g048 . seq ^ 

1 ATGCTCGACA AAGGCGAGGA GTTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTc ggcCCcgTCG ATCCGGTCGG CGATGAAGTC GTCGGTCCCG 

101 CAGGTCCGAC CACAGCCACC CGCATGGACA AATTTACCCG CCAAATGCTC 

151 AAACAAACCG GCCTCTTGGG CATGATCGGC AAATCCGagc gcgGcgcggc 

2 01 cacctGCGAA GCcatCGCCG ACAACAAGGC CGTGTACCTC ATGGCAGTCG 
251 GCGGCGCGGC ATACCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

3 01 GCGTTCCCCG AATTGGGTAT GGAAGCCGTT TACGAATTTG AAGTCAAAGA 

3 51 TATGCCCGTA ACCGTCGCCG TGGACAGCAA AGGCGAATCC ATCCACGCCA 

4 01 CCGCCCCGCG CAAATGGCAG GCGAAAATCG GCATCATCCC CGTCGAGTCT 
451 TGA 

This corresponds to the amino acid sequence <SEQ ID 200: ORF 048 ne>- 
g048.pep " & 

1 MLDKGEELPV DFTNRLIYYV GPVDPVGDEV VGPAGPTTAT RMDKFTRQML 

51 KQTGLLGMIG KSERGAATCE AIADNK AVYL MAVGGAAYLV AKAT KflflTan. 

101 AFPELGMEAV YEFEVKDMPV TVAVDS KGES IHATAPRKWQ AKIGIIPVES 

151 * 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 20 1>- 

m048.seq ^ • 

1 ATGCTCAACA AAGGCGAAGA ATTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTC GGCCCCGTCG ATCCGGTCGG CGATGAAGTC GTCGGTCCGG 



60 



180 
177 
240 
237 
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101 CAGGTCCGAC CACAGCCACC CGCATGGACA AATTCACCCG CCAAATGCTC 

151 GAACAAACCG ACCTCTTGGG CATGATCGGC AAATCCGAGC GCGGCGTGGC 

201 CACCTGCGAA GCCATCGCCG ACAACAAAGC CGTGTACCTC ATGGCAGTCG 

251 GCGGCGCGGC GTATCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

3 01 GCGTTCCCCG AATTGGGCAT GGAAGCCATT TACGAATTTG AAGTCAAAGA 
351 CATGCCCGTA ACCGTCGCCG TAGATAGCAA AGGCGAATCC ATCCACGCCA 

4 01 CCGCCCCGCG CAAATGGCAG GCGAAAATCG GCATCATCCC CGTCGAATCT 
451 TGA 



This corresponds to the amino acid sequence <SEQ ID 202; ORF 048>: 

m04 8 .pep 

1 MLNKGEELPV DFTNRLIYYV GPVDPVGDEV VGPAGPTTAT RMDKFTRQML 

51 EQTDLLGMIG KSERGVATCE A I ADNK AVYL MAVGGAAYLV AKAI KSSKVL 

101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPRKWQ AKIGIIPVES 

151 * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 203>: 

a048 . seq 

1 ATGCTCGACA AAGGCGAAGA ATTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTC GGCCCCGTCG ATCCGGTCGG CGACGAAATC GTCGGCCCAG 

101 CAGGTCCGAC CACCGCCACC CGCATGGACA AATTCACCCG CCAAATGCTC 

151 GAACAAACCG ACCTCTTGGG CATGATCGGC AAATCCGAGC GCGGCGCGGC 

201 CACCTGCGAA GCCATCGCCG ACAACAAAGC CGTGTACCTC ATGGCAGTCG 

251 GCGGCGCGGC GTATCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

301 GCGTTCCCCG AATTGGGCAT GGAAGCCATT TACGAATTTG AAGTCAAAGA 

351 CATGCCCGTA ACCGTCGCCG TAGACAGCAA AGGCGAATCC ATCCACGCCA 

4 01 CCGCCCCGCC CCAATGGCAG GCGAAAATCG GCATCATCCC CGTCAAATCT 

4 51 TGA 

This corresponds to the amino acid sequence <SEQ ID 204; ORF 048. a>: 

a04 8.pep 

1 MLDKGEELPV DFTNRLIYYV GPVDPVGDEI VGPAGPTTAT RMDKFTRQML 
51 EQTDLLGMIG KSERGAATCE AI ADNKA VYL MAVGGAAYLV AKAI KSSKVL 
101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPPQWQ AKIGIIPVKS 
151 * 

m048/a048 96.0% identity over a 150 aa overlap 

10 20 30 40 50 60 

m04 8 . pep MLNKGEELPVDFTNRLIYYVGPVDPVGDEWGPAGPTTATRMDKFTRQMLEQTDLLGMIG 

M : I I I I J I I I I I E I I I f I I I I I I I ! I I I : I t I I I I I I II I ! I I I I i I I I I I I I 

a04 8 MLDKGEELPVDFTNRLIYYVGPVDPVGDEIVGPAGPTTATRMDKFTRQMLEQTDLLGMIG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 8 . pep KSERGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPV 

I I i I I : I I I I I I I I I I M I t I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I II || | | | 
a048 KSERGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPV 

70 80 90 100 110 120 

130 140 150 

m04 8 .pep TVAVDSKGES IHATAPRKWQAKIGIIPVESX 

I I I I ! I M I I I I I II I : I II I I I I I I I : I I 
a 0 4 8 TVAVDSKGES IHATAPPQWQAKI GI I PVKSX 

130 140 150 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 048 shows 96.4% identity over a 150 aa overlap with a predicted ORF (ORF 048.ng) 
from N. gonorrhoeae: 
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m048/g048 

m04 8 .pep 
g04 8 



10 20 30 40 50 60 

MLNKGEELPVDFTNRLIYWGPVDPVGDEWGPAGPTTATI^DKFTRQMLEQTDLLGMIG 

IhMIIIIMIMIIilMIMIIIIMiMIIIMiilMMMllihM IIIIM 

MLDKGEELPVDFTNRLIYYVGPVDPVGDEWGPAGPTTATRMDKPTRQMLKQTGLLGMIG 
10 20 30 40 50 60 



70 80 9 ° 100 HO 120 

m048.pep ^ERGVATCEAI ADNKAVYLMAVGGAAYLVAKAI KSSKVLAFPELGMEAI YEFEVKDMPV 

lllll * I I I I I I I I I I I I I I I I I I | | | | | | | | | | | : | | | | | | | | | | 

KSERGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAVYEFEV 

70 80 SO 100 no 120 



g048 



130 140 150 

m04 8 .pep TVAVDSKGESIHATAPRKWQAKIGI IPVESX 

I M M M II ! 1 1 [ I f] M 1 1 1 1 [ 1 M I M M 

9° 4 8 TVAVDSKGESIHATAPRKWQAKIGI IPVESX 

130 140 150 

The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 205>- 

g049.seq ^ 

1 ATGCGGGCGC AGGCGTTTGA TCAACCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTTCAG GATATTGATT 

101 TGGACGGGCA TCAACGCCTC TTCCGCACCG CCTTCGCCGT TTTCCGCAAC 

151 CCCGTCTGCC GCCGTACCGG ATTCTGCCGC ATCGGCGTTT TCCCCGCCCT 

201 CAATCTGTGC GGTTTCAAAT TCGGCACTGT CTTTTTTGGC ATCGAACCGG 

251 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAAccggca tTTGCAGGGA 

3 01 AGCCTgcgcg TTGAGCCAGT TTTCCTGAAG GACGATCATC GGGT CGGTTT 

3 51 CGACTTCCTC GCCGCAATCG GCAACGGCgc tGTTGTGTTC TTCCTGCCAT 

401 TTCTTCAGAT ACGCCTTTAA 

This corresponds to the amino acid sequence <SEQ ID 206* ORF 049 ne>- 

g04 9.pep ' s ' 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ DIDLDGHQRL FRTAFAVFRN 

51 PVCRRTGFCR IGVFPALNLC GFKFGTVFFG IEPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFD FL AAIGNGAWF FLPFL QIRL* 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 207>- 

m049.seq (partial) v ' 

1 ATGCGGGCGC AGGCGTTTGA TCAGCCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTTCAG GATATTGATT 

101 TGGACGGGCA TCAACGTTTC TTCCGCATCG TTTTCCCCGT TTTCCGAAAC 

151 CGCCGGCTCA TTCGTGCCGG ATTCTGCCTC GTCGGCGTTT TCCCCGCTTT 

201 CAATCTGTCC GGTTTCAAAT TCGACACTGT CTTTTTTGGT ATCAAACCGG 

251 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAACCGACA TTTGCAGGGA 

301 AGCCTGCGCG TTGAGCCAGT TTTCCTGAAG GACGATCATC GGGTCGGTTT 

351 CGACTTCCTC GCCGCAATCG GCAACGGCGG CATTGTGTTC CTCCTGCCAT 

4 01 TTTTTCAGAT ACGCCTT. . . 



This corresponds to the amino acid sequence <SEQ ID 208; ORF 049>- 

m04 9 . pep (partial ) 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ DIDLDGHQRF FRIVFPVFRN 

51 RRLIRAGFCL VGVFPAFNLS GFKFDTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGF DFL AAIGNGGIVF LLPFF OIRL. 

The fo ^ i ^^ i ^ 1 DN A sequence was identified in N. meningitidis <SEQ ID 209>: 



BNSDOCID: <WO 8957280A2J_> 



WO99/57280 PCT/US99/09346 

251 



1 ATGCGGGCGC AGGCGTTTGA TCAGCCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTTCAG AATATTGATT 

101 TGGACGGGCA TCAACGCTTC TTCCGCACCG CCTTCGCCGT TTTCCGCAAC 

151 CCCGTCTGCC GCCGTACCCG ATTCTGCCGC ATCGGCGTTT TCCCCGCCTT 

201 CAATCTGTCC GGTTTCAAAT TCGGCACTGT CTTTTTTGGC AT CAAAC CGG 

251 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAACCGACA TTTGCAGGGA 

301 AGCCTGCGCG TTGAGCCAGT TTTCCTGAAG GACGATCATC GGGTCGGTTT 

351 CGACTTCCTC GCCGCAATCG GCAACGGCGG CATTGTGTTC CTCCTGCCAT 

4 01 TTTTTCAGAT ACGCCTT 

This corresponds to the amino acid sequence <SEQ ID 210; ORF 049.a>: 

a04 9.pep 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ NIDLDGHQRF FRTAFAVFRN 

51 PVCRRTRFCR IGVFPAFNLS GFKFGTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVE PVFLK DDHRVGFD FL AAIGNGGIVF LLFFF QIRL 

m049/a049 90.6% identity over a 139 aa overlap 

10 20 30 40 50 60 

m04 9 . pep MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRFFRIVFPVFRNRRLIRAGFCL 

I I I M I I I I I I I I I I t I I I I M I I I I I M I : I M I I I I I I I I : I I I I j I : | | 

a04 9 MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQN I DLDGHQRFFRTAFAVFRN PVCRRTRFCR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 9 . pep VGVFPAFNLSGFKFDTVFFGIKPDSPPRFDVFFRNRHLQGSLRVE PVFLKDDHRVGFDFL 

• I I I I I I I II I I I I II I I II M I ! I I I I I I I I M I I I I I I I I | | | | || I I M I I I I I I | 
a04 9 I GVFPAFNLSGFKFGTVFFGIKPDSPPRFDVFFRNRHLQGSLRVE PVFLKDDHRVGFDFL 

70 80 90 100 110 120 

130 139 
m04 9 . pep AAIGNGGIVFLLPFFQIRL 

I i I I I I I I I I I I I I I I I I I 
a04 9 AAIGNGGIVFLLPFFQIRL 

130 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 049 shows 86.3% identity over a 139 aa overlap with a predicted ORF (ORF 049.ng) 
from N. gonorrhoeae: 

m049/g049 

10 20 30 40 50 60 

m04 9 . pep MRAQAFDOPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRFFRIVFPVFRNRRLIRAGFCL 

I I I M M M M u I II I II 1 1 1 1 1 1 1 1 I I i i| | M IN hi :| III hIM 

g049 MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRLFRTAFAVFRNPVCRRTGFCR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m049 .pep VGVFPAFNLSGFKFDTVFFGIKPDSPPRFDVFFRNRHLQGSLRVE PVFLKDDHRVGFDFL 

Hllihll llll IIMIhllllllllllllllllllllllllllllllMIIIIII 

9049 IGVFPALNLCGFKFGTVFFGIEPDSPPRFDVFFRNRHLQGSLRVE PVFLKDDHRVGFDFL 

70 80 90 100 110 120 

130 139 
m04 9 . pep AAIGNGGIVFLLPFFQIRL 
M I I lh:| I :| ||:|| || 
g 0 4 9 AAI GNGAWFFLP FLQI RLX 

130 140 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 21 1>- 

gOSO.seq ^ 

1 atgggcgCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGg 

51 cacgcccGAA AAAGccgtgt TGATGGcaaA AGAATCCCTG ATGAGCCACA 

101 TCGAcatCca aGaATTGCAG GAAAAAGCCG CGTccggggc ggaattgtcc 

151 accaccgaAG ccCTGCGCCT cGAACTCTTT GAAAAGGTCA ACGCGCTGGG 

2 01 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GACGTGAAAA 
251 TCCTCGATTA CCCGACCCAT GCCGCCTCCA AACCGATTGC CATGATTCCC 

3 01 AACTGTGCcg ccacCCGcca cgtcgAATTT GAATTGgACG GCTCAGGtcc 
351 TGTCGAactc acgccGCcgc gtgtCGAAGA CTGA 

This corresponds to the amino acid sequence <SEQ ID 212; ORF 050.ng>* 

gOSO.pep 5 & 

1 MGAGWCPPGI LGIGIGGTPE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALG I GA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVED* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 213>: 

m050.seg 

1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGTATCG GCATCGGCGG 

51 C.agCCgAA AAAGCCGTGC TGATGGCAAA AGAGTCCCTG ATGAGCCACA 

101 T CGACATT C A AGAATTGCAG GAAAAGGCCG CGTCCGGCGC GgAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAAAAAGTCA ACGCGCTGGG 

201 CATCGGCGCA CAAGGCTTGG GCGGACTGAC CACCGTGTTG GACGTGAAAA 

251 TCCTCGATTA TCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

3 01 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 

3 51 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA TGGCCCGATT TGA 

This corresponds to the amino acid sequence <SEQ ID 214; ORF 050>: 

m050 .pep ' 

1 MGAGWCPPGI LGIGIGGXAE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDGPI * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 215>- 

aOSO.seq 

1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGG 

51 TACGCCCGAA AAAGCCGTGT TGATGGCGAA AGAATCCCTG ATGAGCCACA 

101 TCGACATCCA AGAATTGCAG GAAAAAGCCG CGTCCGGCGC GGAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAAAAAGTCA ACGCGCTAGG 

201 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GACGTGAAAA 

251 TCCTCGATTA CCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

301 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 

351 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA CTGGCCC 

This corresponds to the amino acid sequence <SEQ ID 216; ORF 050.a>: 

aOSO.pep ' 

1 MGAGWCPPGI LGIGIGGTPE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDWP 

moso/a050 97.7% identity over a 129 aa overlap 

10 20 30 40 50 cn 

m050 . pep T^^T^T^f^f ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 

LF 



a050 



MGAGWCPPGILGIGIGGTPEKAVLMAKESI^SKIDIQELQEKAAsGAELSTTEAiiiEI 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m050 . pep EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVE FELDGSGPVEL 

I M M I i I I M I I I I I I I ! I I i I ] I I I I i I I I I I I M I ! I I I I 1 I I j I I I I I I I | | 1 M | 
a050 EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 

70 80 90 100 110 120 

130 

n\050 . pep TPPRVEDGPIX 

I I I I I II I 
a050 TPPRVEDWP 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from 7V. gonorrhoeae 

ORF 050 shows 98.4% identity over a 127 aa overlap with a predicted ORF (ORF OSO.ng) 
from N. gonorrhoeae: 

m050/g050 

10 20 30 40 50 60 

m050 . pep MGAGWCPPGILGIGIGGXAEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 

MIMIIMIMIIIM: IIIIIMIIIIIIIIIMMIMIIIIMMIIIIIIIIll 

905 0 MGAGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 5 0 . pep EKVNALGIGAQGLGGLTTVLDVKIIjDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 

M M I M I i I I ) M I II , 1 1 I I i II IM M M IMi I I 1 1 I I Mi M I i I I I i I m I i I 

9050 EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 

70 80 90 100 110 120 

130 

m05 0 . pep TPPRVEDGPIX 

IMIMI 

gO 50 TPPRVEDX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 217>: 

g050-l.seq 

1 ATGACCGTTA TCAAGCAAGA AGACTTTATT CAAAGTATCT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCAAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG GATGTGCAAT 

251 GGGATGCGGA CATGAGCGTG GAAAAGATGG TTAACGAAGG CGTACGCCGC 

301 GCCTACACTT GGGAAGGCAA CACCCTGCGC GCTTCCGTCC TCGCCGATCC 

351 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCACA 

4 01 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

451 GGCGGCTCTG AAAACAAATC CAAACTCGCT ATGCTCAACC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATCCC GACGATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGCAcgcC CGAAAAAGCC 

601 GTGTTGATGG cgaAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAAAA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAG GTCAACGCGC TGGGCATCGG CGCGCAAGGC 

7 51 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 

801 CCATGCCGCC TCCAAACCGA TTGCCATGAT TCCCAACTGT GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GTCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGACC CGATCTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGATAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

10 01 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCGCACAAAC GCCTCGTCAA TATGCTCGAC AAAGGCGAGG AGTTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCC GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

1201 AAATTTACCC GCCAAATGCT CAAACAAACC GGCCTCTTGG GCATGATCGG 
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1251 CAAATCCGAG CGCGGCGCGG CCACCTGCGA AGCCATCGCC GACAACAAGG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CATACCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGTA TGGAAGCCGT 

1401 TTACGAATTT GAAGTCAAAG ATATGCCCGT AACCGTCGCC GTGGACAGCA 

1451 AAGGCGAATC CATCCACGCC ACCGCCCCGC GCAAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCGAGTC TTGA 

This corresponds to the amino acid sequence <SEQ ID 218; ORF 050-1 ne>* 

g050-l.pep & 



1 


MTVIKQEDFI 


QSICDAFQFI 


SYYHPKDYID 


51 


ILVNSRMCAE 


NNRPICQDTG 


IATVFLKVGM 


101 


AYTWEGNTLR 


ASVLADPAGK 


RQNTKDNTPA 


151 


GGSENKSKLA 


MLNPSDNIVD 


WVLKTIPTMG 


201 


VLMAKESLMS 


HIDIQELQEK 


AASGAELSTT 


251 


LGGLTTVLDV 


KILDYPTHAA 


SKPIAMIPNC 


301 


PRVED*PDLT 


YSPDNGKRVD 


VDKLTKEEVA 


351 


AHKRLVNMLD 


KGEELPVDFT 


NRLIYYVGPV 


401 


KFTRQMLKQT 


GLLGMIGKSE 


RGAATCEAIA 


451 


IKSSKVLAFP 


ELGMEAVYEF 


EVKDMPVTVA 


501 


GIIPVES* 






g050-l/pl4407 







Sp|P14 407|FUMB_ECOLI FUMARATE HYDRATASE CLASS I, ANAEROBIC ( FUMARASE) 
>gi!280063|pirMB44511 fumarate hydratase (EC 4.2.1.2) fumB, iron-dependent - E .rh P rirh^ ™t 

Identities = 138/488 (28%), Positives = 216/488 (43%), Gaps - 22/488 (4%) 

Query : 11 QSICDAFQFI SYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAENNRPICQDTG 70 

« Q+ DA + H K L+ E + K Q L NS + A+ P CQDTG 

Sbjct : 53 QAFHDASFMLRPAHQKQVAAILHDPEASEND— KYVALQFLRNSEIAAKGVLPTCQDTG 109 

Query: 71 IATVFLKVGMDVQWDADMSVEKMVNEGVRRAYTWEGNTLRASVLADPAGKRQNTKDNTPA 130 

A + KG V W E+ +++GV YEN + A K NT N PA 

Sbjct: 110 TAIIVGKKGQRV-WTGGGD-EETLSKGVYNTYI-EDNLRYSQNAALDMYKEVNTGTNLPA 166 

Query: 131 VIHMSIVPGGKVEVTCAAKGGGSENKSKL AMLNPSDNIVDWVLKTIPTMGAGWCP 185 

I + V G + + C AKGGGS NK+ L A+L P + +++++ + T+G CP 

Sbjct: 167 QIDLYAVDGDEYKFLCVAKGGGSANKTYLYQETKALLTPG-KLKNFLVEKMRTLGTAACP 225 

Query: 18 6 PXXXXXXXXXTPEKAVLMAKESLMSHI DI QELQEKAASGAELS TTEALRLEL FEKVNXXX 245 

P T + L + +H EL + + L EL E+ 

Sbjct: 226 PYHIAFVIGGTSAETNLKTVKLASAHY-YDELPTEGNEHGQAFRDVQLEQELLEEAQKLG 284 

Query: 246 XXXXXXXXXTTVLDVKILDYPTHAASKPIAMI PNCAATRHVEFELDGSG PVELTPP 301 

D++++ P H AS P+ M +C+A R+++ +++ G +E P 

Sbjct: 285 LGAQFGGKYFAH-DIRVIRLPRHGASCPVGMGVSCSADRNIKAKINREGIWIEKLEHNPG 343 

uery: 302 RVEDXPDLTYSPDNGKRVDVDKLTKE EVASWKTGDVLLLNGKILTGRDAAHKRLVNM 358 

eK . _ 0 „„ + +VD+++ KE +++ + L L G 1+ GRD AH +L + 

Sbjct: 344 QYIPQELRQAGEGEAVKVDLNRPMKEILAQLSQYPVSTRLSLTGTIIVGRDIAHAKLKEL 403 

Query: 359 LDKGEELPVDFTNRLI YYXXXXXXXXXXXXXXXXXTTATRMDKFTRQMLKQTGLLGMIGK 418 

+D G+ELP + IYY TTA RMD + + G + M+ K 

Sbjct: 404 IDAGKELPQYIKDHPIYYAGPAKTPAGYPSGSLGPTTAGRMDSYVDLLQSHGGSMIMLAK 463 

Query: 419 SERGAATCEAIADNKAVYLMAVGG-AAYLVAKAIKSSKVLAFPELGMEAVYEFEVKDMPV 477 

R +A + YL ++ GG AA L ++IK + +A+PELGMEA+++ EV+D P 

Sbjct: 4 64 GNRSQQVTDACHKHGGFYLGSIGGPAAVLAQQSIKHLECVAYPELGMEAIWKIEVEDFPA 523 

Query: 478 TVAVDSKG 485 

+ VD KG 
Sbjct: 524 FILVDDKG 531 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 219>: 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AGCTACTATC ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCTGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCGGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG AACGTCCAAT 

251 GGGATGCGGA CATGAGCGTG GAAGAGATGG TTAACGAAGG CGTACGCCGC 



BNSDOCID: <WO 9857280A2_I_> 



WO 99/57280 




PCT/US99/09346 



255 



301 GCCTACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTCC TCGCCGATCC 

351 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCATA 

401 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

4 51 GGCGGCTCTG AAAACAAATC CAAACTCGCC ATGCTCAATC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATCCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGT ATCGGCATCG GCGGCACGCC CGAAAAAGCC 

601 GTGCTGATGG CAAAAGAGTC CCTGATGAGC CACATCGACA TTCAAGAATT 

651 GCAGGAAAAG GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TGGGCATCGG CGCACAAGGC 

751 TTGGGCGGAC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTATCCGAC 

801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGGCC CGATTTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCAAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT' CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCG GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

1201 AAATTCACCC GCCAAATGCT CGAACAAACC GACCTCTTGG GCATGATCGG 

1251 CAAATCCGAG CGCGGCGTGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 

1401 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGATAGCA 

14 51 AAGGCGAATC CATCCACGCC ACCGCCCCGC GCAAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCGAATC TTGA 

This corresponds to the amino acid sequence <SEQ ID 220; ORF 050- 1>: 

m050-l .pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM NVQWDADMSV EEMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGG KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVEDWPDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVDMLN KGEELPVDFT NRLIYYVGPV DPVGDEWGP AGPTTATRMD 

401 KFTRQMLEQT DLLGMIGKSE RGVATCEAIA DNKAVYLMAV GGAAYLVAKA 

451 IKSSKVLAFP ELGMEAIYEF EVKDMPVTVA VDSKGESIHA TAPRKWQAKI 

501 GIIPVES* 

m050-l/g050-l 98.2% identity in 507 aa overlap 

10 20 30 40 50 60 

mOSO-l.pep MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 
I I 1 I I I I I I ! II I I I I t I I 1 I I I! I I I I I I II I I I 1! I I I I t I I I I I I I I I t I I I I I I I I 
g050-l MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m050-l . pep NNRPICQDTGIATVFLKVGMNVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 
I I I I I I I I I I I I I I I I I I I I : I I I M I ! 1 I I : I I I I I I I I I I I I I I I I ! I I I I I I I I I I I 
g050-l NNRPICQDTGIATVFLKVGMDVQWDADMSVEKMVNEGVRRAYTWEGNTLRASVLADPAGK 

70 80 90 100 110 120 

130 140 150 160 170 180 

m050-l . pep RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 
I I I I I I I I I I I I I I 1 I I I I I I I I M I I I t I I II 1 I I I I I I 11 I I I I I I I I I I II I I I I I I 
g050-l RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m050-l . pep AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 
I I I I I I I I I I I I I I I I M I I I I I I I I i I ! I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I t 
g050-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m050-l . pep VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 
I I I I I M 11 I I I I I I I I I I I I I I I I 1 t I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | 
g0 5 0- 1 VNALG I GAQGLGGLTTVLDVKI LD YPTHAASKPI AMI PNCAATRH VEFELDGSGPVELT P 

250 260 270 280 290 300 
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310 320 330 340 350 360 

m050-l.pep PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLN 

1 1 1 1 ] n 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 1 n 1 1 1 1 n 1 1 1 n 1 1 1 1 1 n 1 1 1 1 1 i 1 1 1 : 1 1 : 

g050-l PRVEDXPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVNMLD 
310 320 330 340 350 360 

370 380 390 400 410 420 

mOSO-l.pep KGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 
"IIIIIIHIIIIIIIM|IMIIIIIIIHHIMIIII||IH|:|| II I IN || | 
g050-l KGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLKQTGLLGMIGKSE 

370 380 390 400 410 420 

430 440 450 460 470 480 

mOSO-l.pep RGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPVTVA 
ftert , 1 ' : ' I I I I I I I I H I I t I I I I I II I I I I i | 1 | (I I I I I I | | | | | | | : 1 | | | | | | | ! |( j j' 

g050-l RGAATCEAIADNKAVYLMAVGGAAYLVAKA I KSSKVLAFPELGMEAV YEFEVKDMPVTVA 

430 440 450 460 470 480 

490 500 
m050-l .pep VDSKGESIHATAPRKWQAKIGIIPVESX 
i M I I I I I I I I I | I | | | | | | | | | | | | [ | 
g050-l VDSKGESIHATAPRKWQAKIGIIPVESX 

490 500 

The following partial DNA sequence was identified in TV. meningitidis <SEO ID 22 1>- 

a050-l.seq 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AACCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGCAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGATACCGGT ATCGCGACCG TGTTTTTGAA AGTCGGTATG GATGTGCAAT 

251 GGGATGCAGA CATGAGCGTC GAAGAGATGG TTAACGAAGG CGTGCGCCGC 

301 GCCTACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTTC TCGCCGACCC 

351 CGCCGGCAAA CGCCAAAATA CCAAAGACAA CACGCCCGCC GT CATC CAT A 

4 01 TGAGCATCGT GCCGGGCGAC AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

451 GGCGGTTCTG AAAACAAATC CAAACTCGCC ATGCTCAACC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATTCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGTACGCC CGAAAAAGCC 

601 GTGTTGATGG CGAAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAAAA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TAGGCATCGG CGCGCAAGGC 

751 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 

801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGGCC CGATTTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCGAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGACGAAAT CGTCGGCCCA GCAGGTCCGA CCACCGCCAC CCGCATGGAC 

1201 AAATTCACCC GCCAAATGCT CGAACAAACC GACCTCTTGG GCATGATCGG 

1251 CAAATCCGAG CGCGGCGCGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 

1401 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGACAGCA 

1451 AAGGCGAATC CATCCACGCC ACCGCCCCGC CCCAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCAAATC TTGA 

This corresponds to the amino acid sequence <SEQ ID 222; ORF 050-l.a>* 

a050-l.pep 5 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM DVQWDADMSV EEMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGD KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTI PTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVEDWPDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVDMLD KGEELPVDFT NRLIYYVGPV DPVGDEIVGP AGPTTATRMD 

4 01 KFTRQMLEQT DLLGMIGKSE RGAATCEAIA DNKAVYLMAV GGAAYLVAKA 

451 IKSSKVLAFP ELGMEAIYEF EVKDMPVTVA VDSKGESIHA TAPPOWOAKT 

501 GIIPVKS* 
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a050-l/m050-l 98.4% identity in 507 aa overlap 

10 20 30 40 50 60 

a050-l.pep MTVIKQEDFIQSICDAFQFISYYHPKDyiDALYKAWQKEENPAAKDAMTQILVNSRMCAE 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

m0 5 0- 1 MTVI KQEDFI QS ICDAFQFI S Y YHPKD Y I DALYKAWQKEENPAAKDAMTQI LVNSRMCAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

a050-l.pep NNRPICQDTGIATVFLKVGMDVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 
t I I I I ! I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I M I II I 
m050-l NNRPICQDTGIATVFLKVGMNVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 

70 80 90 100 110 120 

130 140 150 160 170 180 

a050-l .pep RQNTKDNTPAVIHMSIVPGDKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 
I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I 
m050-l RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 

130 140 150 160 170 180 

190 200 210 220 230 240 

aO 50-1 .pep AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 
I I I I I I II I I I I I I II I I II I II I I I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m0 50-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 

190 200 210 220 230 240 

250 260 270 280 290 300 

aO 50-1 .pep VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMI PNCAATRHVEFELDGSGPVELTP 
I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 1 I I I I II I I I I I I I I I I I I I I I I I I 
ttOSO-l VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPI AMI PNCAATRHVEFELDGSGPVELTP 

250 260 270 280 290 300 

310 320 330 340 350 360 

a050-l . pep PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLD 
I II I I I I I I J I I I I I I I II I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I : 
m050-l PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLN 

310 320 330 340 350 360 

370 380 390 400 410 420 

aO 5 0-1. pep KGEELPVDFTNRLIYYVGPVDPVGDEIVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 
I M I I I I I I I I I I I I I I I II I I I I I I : I I I I I I I M I I I I I J I I I I I I I I I 1 I I I I I I I I 
m050-l KGEELPVDFTNRLIYYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 

370 380 390 400 410 420 

430 440 450 460 470 480 

aO 5 0-1. pep RGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPVTVA 
I I : I I I I I I 1 I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I i I I I I I I I j I I I I I I I I I I I 
m050-l RGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPVTVA 

430 440 450 460 470 480 

490 500 
aO 5 0-1. pep VDSKGESIHATAPPQWQAKIGIIPVKSX 
I I I I I I I I I I I I I : I I I I I I I I I I : I I 
mO 5 0 - 1 VDSKGESI HATAPRKWQAKI GIIPVESX 

490 500 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 223>: 
g052 . seq 

1 ATGGCTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTT CAAAGG 
51 CTGCGAGCCG ACGGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 
101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 
151 AAGGGGTTGG ACGGCGTTTC C AAAAAC AG C AGTTTGGTGT TGGCTTTGAC 
201 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCGG CTCACTCCGA 
251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 
3 01 AGGCTGCGGC TGGAAACCAC ATGGTCGCCC GCCTGCAGGA AGGTGAAAAA 
351 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 224; ORF 052.ng>: 

g052 . pep 
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1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DTR LTPMPNLVTM LLIKPTWPN 

101 RLRLETTWSP ACRKVKNAA* 

The following partial DNA sequence was identified in K meningitidis <SEO ID 225>- 

m052.seq 

1 ATGGCTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGCGAGCCG ACGGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 

151 AAGGGGTTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCGG CTCACTCCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

301 AGGCTGCGGC TGGAAACCAC ATGGTCGCCC GCCTGCAGGA AGGTGAAAAA 

3 51 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 226; ORF 052>: 

m052 .pep 

1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DTR LTPMPNLVTM LLIKPTWPN 

101 RLRLETTWSP ACRKVKNAA* 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 227>- 

a052.seq 

1 ATGGCTTTGG TCGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGAGAGCCG ACAGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCTCCC 

151 AAGGGATTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCGTTTA TATCAGTCGG CGACACGTGA CTCACTTCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

301 AGGCTGCGGC TGGAAATCAC ATGGTCGCCC GCCTGCAAAA AGGTGAAAAA 

351 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 228; ORF 052.a>: 

a052 . pep 

1 MALVAEETEI SAPCFKG*EP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DT* LTSMPNLVTM LLIKPTWPN 

101 RLRLEITWSP ACKKVKNAA* 



m052/a052 95.8% identity over a 1 19 aa overlap 



m052 .pep 
a052 

m052 . pep 
a052 



10 20 30 40 50 60 

MALVAEETEISAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 
N i I I I I I I I I I I I I I I ! I I! II I I i I M I I I I I ! M ! I I I i I I i I M I I II f II I I 1 I 

MALVAEETEISAPCFKGXEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 
10 20 30 40 50 60 

7 ° 80 90 100 110 120 

SLVLALTAAFHSFISVGDTRLTPMPNLVTMLLIKPTWPNRLRLETTWSPACRKVKNAAX 

1 I i I I I I I i I I I I j I M | | | | | | | | | | | M I I I I I I I M I I I: !| 

S LVLALT AAFHS FI S VGDTXLT SMPNLVTMLLI KPT WPNRLRLE I TWS PACKKVKNAAX 

70 80 90 100 110 120 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 052 shows 95.8% identity over a 1 19 aa overlap with a predicted ORF (ORF 052.ng) 
from N. gonorrhoeae: 

tn052/g052 

10 20 30 40 50 60 

m052.pep MALVAEETE I S APCFKGCEPTGDSRLLSTTKSAPMPCANSAKAS KSATS PKGLDGVS KNS 

IIINIII!M1IMMII!M||||||MIIIIMMIIIM!II||||M||MMI| 
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g052 



MALVAEETEISAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 
10 20 30 40 50 60 



tn052 - pep 



70 80 90 100 110 120 

S LVLALTAAFHS F I S VGDTWLTSMPNLATMLL I KPTWPNRLRLE I TWS PACKKVKNAAX 



g052 



S LVLALTAAFHS F I SVGDTRLTPMPNLVTMLLI KPTWPNRLRLETTWS PACRKVKNAAX 
70 80 90 100 110 120 




The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 229>: 



1 ATGTGTATGC CATACGCAAT AAGGGTTTCA GACGGCATCT GCCGCATTTT 

51 TCCGCCGATG CCGTCTGAAA CACGCAATCA GCGCGCGAGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA ACATATTCCA AACCGACCGA CAGGCGCACC 

151 AGTCCGGGGC GGatacCGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

201 GTGCGTGGTT GTCCACGGAT TGGTGATGGT CGAGCGCACG TCGCCGAGGT 

251 TGGCGGTACG GGAAAAGAGT TCCACGACTT TCCACGCGGC TGCTTGGTCG 

3 01 GCGACTTCAA AACCGATGAC GATGCCGCCG CCGTTTTGCT GTTTGCGGAT 

3 51 AAGCTCCGCC TGCGGATGGT CGGGCAATCC GGTGTAG 



This corresponds to the amino acid sequence <SEQ ID 230; ORF 073 .ng>: 



1 MCMPYAIRVS DGICRIFPPM PSETRNORAS ACFKSSIKSP TYSKPTDRRT 
51 SPGRIPAASF SSGCILPCW VHGLVMV ERT SPRLAVREKS STTFHAAAWS . 
101 ATS KPMTMPP PFCCLRISSA CGWSGNPV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23 1>: 



1 ATGTGTATGC CATATAAGAT AAGGGTTTCA GACGGCATCT GCTGTCCAAT 

51 GCCGTCTGAA ACACGCAATC AGCGTGCGAG TGCCTGTTTC AAATCGTCAA 

101 TCAAATCGCC AACATATTCC AAACCGACCG ACAGGCGCAC CAATCCGGGG 

151 CGGATGTTGG CGGCGAGTTT TTCTTCGGGC TGCATCCTGC CGTGCGTGGT 

201 TGTCCACGGG TGGGTAATGG TCGAGCGCAC GTCACCGAGG TTGGCGGTGC 

251 GGGAAAAGAG TTCCACGCCG TCCACAACTT TCCACGCCGC TTCTTGATCG 

301 GCAACTTCAA AGCCGATGAC GATGCCGCCG CCGTTTTGCT GTTTGCGGAT 

351 AAGCGCCGCC TGAGGATGGT CGGACAATCC GGTGTAG 



This corresponds to the amino acid sequence <SEQ ID 232; ORF 073>: 



1 MCMPYKIRVS DGICCPMPSE TRNQRASACF KSSIKSPTYS KPTDRRTNPG 
51 RMLAASF SSG CILPCWVHG WVMV ERTSPR LAVREKSSTP STTFHAASXS 
101 ATS KPMTMPP PFCCLRISAA XGWSDNPV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 233>: 



1 ACGTGTATGT CATATAAGAT AAGGGTTTCA GACGGCATTT GCGGTGTTTT 

51 TCCGCCGATG CCGTCTGAA. CACGCAATCA GCGCGCGAGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA ACATATTCCA AACCGACCGA CAGGCGCACC 

151 AATCCGGGGC GGATGTTGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

201 GTGCGTGGTT GTCCACGGAT GGGTAATGGT CGAGCGCACG TCGCCGAGGT 

251 TGGCGGTACG GGAGAAAAGT TCGACGCCGT CCACGACTTT CCACGCGGCT 

301 GCTTGGTCGG CGACTTCAAA GCCGATGACG ATGCCGCCGC CGTTTTGCTG 

351 TTTGCGGATA AGCTCCGCCT GAGGATGGTC GGGTAATCCG GTGTAA 

This corresponds to the amino acid sequence <SEQ ID 234; ORF 073. a>: 

a073 .pep 

1 TCMSYKIRVS DGICGVFPPM PSEXRNQRAS ACFKSSIKSP TYSKPTDRRT 

51 NPGRMLAASF SSGCILPCW VHGWVMV ERT SPRLAVREKS STPSTTFHAA 

101 AWSATSKPMT MPPPFCCLRI SSA*GWSGNP V* 



g073 . seq 



g073 .pep 



m073 . seq 



m073 .pep 



a073 . seq 



m073/a073 92.3% identity over a 130 aa overlap 
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10 20 30 40 50 

m07 3 .pep MCMPYKIRVSDGICC PMPSETRNQRASACFKSSIKSPTYSKPTDRRTNPGRMLAASF 

II M I I I I I I I I > I M I : 1 I I I I M I I 1 i M i M 1 i I 1 I I I I j M I I I 1 | f I | j 

a073 TCMSYKIRVSDGICGVFPPMPSEXRNQRASACFKSSIKSPTYSKPTDRRTNPGRMLAASF 

10 20 30 40 50 60 

60 70 80 90 100 110 

m07 3 . pep SSGC I LPCVVVHGWVMVERTS PRLAVREKSST PS TTFHAASX SAT SKPMTMPPPFCCLR I 

M I M I I! I I i I ! I M M I M f I I i I I I ! I I I j 1 I I I I I t : I 1 I I I I 1 J 1 I I I j M | M 
SSGCILPCWVHGWVMVERTSPRLAVREKSSTPSTTFHAAAWSATSKPMTMPPPFCCLRI 
7 0 80 90 100 110 120 



a073 



120 129 
ra07 3 . pep SAAXGWSDNPVX 

I : I I I I I I I I I 
a 0 7 3 S SAXGWSGNPVX 

130 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 073 shows 87.0% identity over a 131 aa overlap with a predicted ORF (ORF 073. ng) 
from N. gonorrhoeae: 

m073/g073 

10 20 30 40 50 

m073 . pep MCMPYKIRVSDGICC- - -PMPSETRNQRASACFKSSIKSPTYSKPTDRRTNPGRMLAASF 

Mill MINIM M M 1 1 II II II 1 1 1 1 1 1 1 ! M M M I II II h ! I h MM 

S073 MCMPYAIRVSDGICRIFPPMPSETRNQRASACFKSSIKSPTYSKPTDRRTS PGRIPAASF 

1° 20 30 40 50 60 

60 70 80 90 100 HO 

m073 . pep SSGCI LPCVVVHGWVMVERTS PRLAVREKSSTPSTTFHAASXSATSKPMTMPPPFCCLRI 

II IN MM MM IM II Mill Ml I MM II M h II II 1 1 II II II 1 1 1| || 

9° 73 SSGCI LPCVVVHGLVMVERTS PRLAVREKSST - - -TFHAAAWSATSKPMTMPPPFCCLRI 

7 0 80 90 100 HO 

120 129 
m073 .pep SAAXGWSDNPVX 

Ml Ml MM 

g073 SSACGWSGNPVX 
120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 235>- 

g075,seq 

1 ATGCCGCCTT ACTTCATCAC CCTCTTAACG ATGGAAAATA CAAAAAGCGC 

51 GGCGAAAACG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCCGTTTCGG GCAACGCTGC GTTTGCCTGT 

151 GCCGCCAAAG CCAGCGGGGC GGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTT ACGAAATTTT 

251 TAAAAAAATG TGTTTGCGGG CTTTGTGAAG GTTTTAGAGA CCGCCTGCCG 

3 01 GGCCTCTTAA ACTTAATCTT CTTTTTCGTA GAATCCGAAA ATTACAAATT 

3 51 CCCCGCCTAT CTCTTCCAAT GCCGAGCTAA AAGCGTCTTC ATAGCTGTCA 

4 01 TATTTACCGG CTGA 

This corresponds to the amino acid sequence <SEQ ID 236; ORF 075 .ng>: 

g075.pep 

1 MPPYFITLLT MENTKSAAKT PTTIQPASIP SAFAASKAFF AVSGNAAFAC 

51 AAKASGAAVT TASFAPYLRQ VLINFMIFSF TKFLKKCVCG LCEGFRDRLP 

101 GLLNLIFFFV ESENYKFPAY LFQCRAKSVF IAVIFTG* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 237>: 

m075.seq 

1 ATGCCGTCTT ACT TCATC AC TCTCTTAACG ATGGAAAATA CAAAAAGCGC 

51 GGCGAAAATG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCCGTATCGG GCAACGTTGC ATTTGCATGT 

151 GCGGCCAAAG CCAGGGGAGC AGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTC AAAAAGTGTT 

251 TGGCGGTAAT GGATGGAGCG TTTTTCAGAC GACCGCCGAA CATCCGAAAA 

301 TCAGTCTTTC AAAAATCCGA ATACGACAAA TTCGTATTGG TTGCCGATTT 

3 51 CTTCCAAACC TGCGTTAATC GCTTCTTCGA AGTCGTAGAA ATAATCGGCA 

401 TTGGTGATTA A 

This corresponds to the amino acid sequence <SEQ ID 238; ORF 075>: 

m075 -pep 

1 MPSYFITLLT MENTKSAAKM PTTIQPASIP SAFAASKAFF AVSGNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLINFMIFSF KKCLAVMDGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEWE IIGIGD* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 075 shows 65.7% identity over a 137 aa overlap with a predicted ORF (ORF 075. ng) 
from N. gonorrhoeae: 

m075/g075 

10 20 30 40 50 60 

MPS YFITLLTMENTKSAAKMPTTIQPAS I PSAFAASKAFFAVSGNVAFACAAKARGAAVT 

II MIIMIIMIIIMI II i I I I M M II il III II II Ml I : M i II! M Mill 

MPPYFITLLTMENTKSAAKTPTTIQPAS I PSAFAASKAFFAVSGNAAFACAAKASGAAVT 
10 20 30 40 50 60 

70 80 90 100 110 

TASFAPYLRQVLINFMIFSF KKCLAVMDGAFFRRPPNI RKSVFQKSEYDKFVLVAD 

MM MM M Mill I MM II 1= = ' I I | = = : M | : = : : i 

TASFAPYLRQVLINFMI FSFTKFLKKCVCGLCEGFRDRLPGLLNLI FFFVESENYKFPAY 
70 80 90 100 110 120 

120 130 
FFQTCVNRF FE WE 1 1 G I GDX 

Ml := I M : I 
LFQCRAKSVFI AVI FTGX 
130 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 239>: 

a075 .seq 

1 ATGCCGTCTT ACTTCATCAC TCTCTTAACG ATGGAAAAGA CAAAAAGCGC 

51 GGCGAAAACG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCTGTATCGG GCAACGTTGC ATTTGCATGT 

151 GCGGCCAAAG CCAGGGGAGC AGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTC AAAAAGTGTT 

251 TGGCGGTAAT GGATGGAGCG TTTTTCAGAC GACCGCCGAA CATCCGAAAA 

301 TCAGTCTTTC AAAAATCCGA ATACGACAAA TTCGTATTGG TTGCCGATTT 

351 CTTCCAAACC TGCGTTAATC GCTTCTTCGA AGTCGTAGAA ATAATCGGCA 

401 TTGGTGATTA A 

This corresponds to the amino acid sequence <SEQ ID 240; ORF 075. a>: 

a075 .pep 

1 MPSYFITLLT MEKTKSAAKT PTTIQPASIP SAFAASKAFF AVSGNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLINFMIFSF KKCLAVM DGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEWE IIGIGD* 



m075 . pep 
g075 

m075 .pep 
g075 

m075 . pep 
g075 
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m075/a075 98.5% identity over a 136 aa overlap 

10 20 30 40 50 60 

m07 5 . pep MPSYFITLLTMENTKSAAKMPTTIQPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 
M I II I I I I I I |: I I M I I M I I f M M I I I M I II II I M | | | | | | || | [ | | | | | j | | 
MPSYFITLLTMEKTKSAAKTPTTIQPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 
10 20 30 40 50 60 



a075 



70 80 90 100 HO 120 

m075 . pep TASFAPYLRQVLINFMIFSFKKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVADFFQT 
1 ' I 1 N M I I I I I I | | | | | | | | | | | | | | | | | | | | | | , | | | , , , , , , | , , M j , (j , j , 

TASFAPYLRQVLINFMIFSFKKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVADFFQT 
70 80 90 100 HO 120 



a075 



130 

m075.pep CVNRFFEWEI IGIGDX 

I I j I II I I I I I I I I I | I 
a075 CVNRFFEWEI IGIGDX 

130 

The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 24 1>- 
gOSO.seq v 
1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 
51 CATGATGGCG ATG CTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 
101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA C C TGGTTT AT 
151 TCCGATAAGA AGGCATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 
201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAAGCCTAC CGCCGGTATC 
251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA TACGGTTGAG 

3 01 GTCGTCCTGA CCGAGCGCAA GCCGGTTGCA CGTTGGGGCG ACCATGCCTT 
351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

4 01 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGC CGAAAT GCTCCGCCGT 
4 51 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG G CAT CAAAGA 
501 GATGACCTAT ACGGCACGTT CGGCGTGGAA TGTCGTTTTG GACAACGGCA 
551 TCACCGTCAG GCTCGGACGG GAAAAcga GA TGAAACGCCT CCgGCTTTTT 
601 ACcgAAGCGT GGCAGCATCT gttgcGTAAG AATAAAAATC GGTTATCCTA 
651 TGTGGATATG Aggtataagg acggatttTC agtcccccat gctCCCGACG 
701 GTTTACCCGA AAAAGAATcc gAAGAATatt gggaacaggt ttgggacata 
751 ttacggcctg gcgtcggaaa cggttcgacg caaatttcaa tcagttatAA 
801 GGGCAGacga acaatggaac AGcagtaa 

This corresponds to the amino acid sequence <SEQ ID 242; ORF 080.ne>- 

gOSO.pep ' & 

1 MWDNAEAME R LTRWLItVMMA MLIA ASGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKALGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGliGl KEMTY TARSAWNWL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVPH APDGLPEKES EEYWEQVWDI 

251 LRPGVGNGST QISISYKGRR TMEQQ* 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 243>- 

m080 . seq ^ ' 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTGGTTTAT 

151 TCCGATAAGA AGACATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

3 01 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

401 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 

501 GATGACCTAT ACGGCACGTT CGGCGTGGAT TGTCGTTTTG GACAACGGCA 

551 TCACCGTCAG GCTCGGACGG GAAAACGAGA . TGAAACGCCT CCGGCTTTTT 
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601 ACCGAAGCGT GGCAGCATCT GTTGCGTAAA AATAAAAATC GGTTATCCTA 
651 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTTCCGACG 
701 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 2441 ; ORF 080>: 
m080 .pep 

1 MWDNAEAMER LTRWLLVMMA MLLA ASGLW FYNSNHLPVK QVSLKGNLVY 

51 SDKKTLGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARS AW I WL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKWRLSYVDM RYKDGFSVRY ASDGLPEKES EE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 080 shows 97.9% identity over a 242 aa overlap with a predicted ORF (ORF 080.ng) 
from N. gonorrhoeae: 

m080/g080 

10 20 30 40 50 60 

MWDNAEAMERLTRWLLVMMAMLLAASGLWFYNSNHLPVKQVSLKGNLVTSDKKTLGSIjA 
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10 20 30 40 50 60 
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KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 
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KE Y I HGNI LRTDINGAQEAYRRYPW I AS VMVRRRF PDTVEVVLTERKPVARWGDHALVDG 
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130 140 150 160 170 180 

EGOTFEARLiDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWIVVL 
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EGNVFEARIiDRPGMPVFRGAEGTSAEMLRRYDEFSTVIiAKQGLGIKEMTYTARSAWNWL 
130 140 150 160 170 180 

190 200 210 220 230 240 /* 

DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYASDGLPEKES 

M MM IMIMII IN M MMM MMIMMIM I Ml M Mm ; : , i||||| M 
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m08 0 .pep 
080 

m080 .pep 
080 

m080 .pep 
080 

m080 .pep 
080 



m080.pep EEX 
II 

080 EEYWEQVWDI LRPGVGNGSTQI S I S YKGRRTMEQQX 

250 260 270 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 245>: 

a080. seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTAGTTTAT 

151 TCCGATAAGA AAGCATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGTTTGGAC AGACCCGGAA 

401 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG G CAT C AAAG A 

501 GATGACCTAT ACGGCACGTT CGGCGTGGAT TGTCGTTTTG GACAACGGCA 
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551 TCACCGTCAG GCTCGGACGG GAAAACGAGA TGAAACGCCT CCGGCTTTTT 

601 ACCGAAGCGT GGCAACATCT GTTGCGTAAA AATAAAAATC GGTTATCCTA 

651 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTCCCGACG 

701 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 246; ORF 080.a>: 

a080.pep 

1 MWDNAEAMER LTRWLLVMMA MLLA ASGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKALGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDE FSTVLAK QGLGIKEMTY TARSAWIVVL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVRY APDGLPEKES EE* 

m080/a080 99.2% identity over a 242 aa overlap 



10 20 30 40 50 60 

m080 . pep MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVYSDKKTLGSLA 
1 1 11 N I M II I I I I I M M I M I II 1 I I M M ! I II II I M I II I I M II I I I M M M 
MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVYSDKKALGSLA 
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m080.pep KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 
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130 140 150 160 170 180 

m080 . pep EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWIWL 
1 1 I [ ' t I I I I I I I i I II I I I II I | I | M I I I I II I M I I I I I M M II M M I I I II I I ! 
EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWIWL 
130 14 ° 150 160 170 180 
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m08 0 . pep DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYAS DGLPEKES 

1,11 Nil IN INI MM HIM IN IMIIIIMMI1IIM III] ill IIMIMI 
a080 DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYAPDGLPEKES 

240 



190 200 210 220 230 



m080.pep EEX 

I I I 
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The following partial DNA sequence 



g081 . seq 






1 


ATGAAACCAC 


TGGACCTAAA 


51 


GCCGTCTGAA 


AACAAACCCG 


101 


TTCGGGAAGG 


CGATGTGTTT 


151 


CATGATTTTG 


TTGGAGGCGT 


201 


TTCGCGCGAA 


GATTGCGCGG 


251 


CGCTTGCCGC 


GTTGCAAACG 


301 


CCGTTTGTGT 


TCGGCATTAC 


351 


GATGCTGGCT 


GCGGTATTGC 


401 


CGACGGCAGG 


CAACTTCAAC 


451 


AAATtaaAcg 


aAAAACACCG 


501 


TTTTGGcgaa 


ctggcggtTt 


551 


TGGt caACAA 


CGCCCTGCGC 


601 


GGCGATATTG 


CCAAAGcgaa 


651 


CGGCATGGCA 


CTGATTCCTC 


701 


CGGCAACGTT 


TAATTTGAAT 


751 


GTCCGCGCGG 


AAAATATCGT 



identified in K gonorrhoeae <SEQ ID 247>: 

TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 
TGTCGCGCAT CGTAACCGAC AGCCGCGATA 
TTCGCATTGG CGGGCGGGCG GTTTGACGCG 
ATTGTCTGCG GGCGCGGCGG CGGTTGTGGT 
CTTTGGGCGG CGCGTTGAAA GTCGATGACA 
TTGGCGAAGG CGTGGCGCGA TAATGTGAAC 
CGGTTCGGGC GGCAAGACGA CGGTGAAGGA 
GCCGCCGTTT CGGCGATGAT GCCGTTTCGG 
AACCACAtcg gaTTGCCGCT GACTTTATTG 
CTATGCCGTG ATTGAAATGG GCATGAACCA 
taacgcaaaT CGCCAAACCC GATGCCGCTT 
GCCCATGTCG GATGCGGTTt cgacggagtg 
aag cGAGATT TatgcagGct tATGTTCAGA 
AAGAAGATGC CAATATGGCT GTCTTCAAAA 
ACGTGCACTT TCGGCGTCGA TAGCGGCGAT 
GCTGAAACCT TTGTCGTGCG AATTTGATTT 
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801 GGTGTGCGGC GACGAGCGCA CTGCCGTGGT GCTGCCTGTT CCCGGCCGCC 

8 51 ACAATGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGCTGCCGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG CAAGGCTTCA GCAACATCAA 

951 AGGCCGTCTG AACGTCAAAG CCGGCATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAATCCC GACAGTATGA AAGCCGCGGT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG CAT TTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGCGAGGAc gaAGCCGCCG CCATGCACGC CGAagtcgGC GCGTACGCCC 

1151 GCGACCAAGG CAT CGAAGCG GCTTATTTTG TCGGCGACAA CAG CGTCGAA 

1201 GCGGcggaAA AATTTGGCGC GGACGGTTTG TGGTTCGCCG CCAAAGACCC 

12 51 GTTGATTCAA GTGTTGAGCC ACGATTTGCC CGAACGCGCC ACCGTGTTGG 

13 01 TGAAAGGTTC GCGCTTTATG CAGAtggAAG AAGTGGTCGA GGCATTGGAG 
13 51 GATAAGTga 

This corresponds to the amino acid sequence <SEQ ID 248; ORF 081.ng>: 

g081.pep 

1 MKPLDLNFIC QALKLPMPSE NKPVSRIVTD SRDIREGDVF FALAGGRFDA 

51 HDFVGGVLSA GAAAVWS RE DCAALGGALK VDDTLAALQT LAKAWRDNVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDD AVSATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTQIAKP DAALVNNALR AHVGCGFDGV 

2 01 GDIAKAKSEI YAGLCSDGMA LI PQEDANMA VFKTATFNLN TCTFGVDSGD 

2 51 VRAENIVLKP LSCEFDLVCG DERTAWLPV PGRHNVHNAA AAAALALAAG 

3 01 LS LNDVAEGL QGFSNIKGRL NVKAGI KG AT LIDDTYNANP DSMKAAVDVL 

3 51 ARMPAPRIFV MGDMGELGED EAAAMHAEVG AYARDQGIEA AYFVGDNSVE 

4 01 AAEKFGADGL WFAAKDPLIQ VLSHDLPERA TVLVKGSRFM QMEEWEALE 
451 DK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 249>: 

m081 . seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTT CCGAT 

51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGTTT TTCGCATTGG CGGGCGAGCG GTTTGACGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCT GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGTGCTG CAATGGATGG CGCGTTGAAA GTCGATGACA 

251 CGCTTGCCGC ATTGCAAACG CTGGCAAAGG CGTGGCGTGA AAATGTGAAT 

3 01 CCGTTTGTGT TCGGCATTAC CGGTTCGGGC GGCAAGACGA CGGTGAAGGA 

3 51 AATGCTGGCT GCGGTATTGC GCCgCCGTTT CGGCGATGAT GCCGTGTTGG 

4 01 CGACGGCAGG CAACTTCAAC AACCATATCG GATTGCCGCT GACTTTGTTG 
4 51 AAGTTAAACG AAAAACACCG CTATGCCGTG ATTGAAATGG GCATGAACCA 
501 TTTCGGCGAA CTGGCGGTTT TAACGCAmAT CGCCAAACCA AATGCCGCAT 
551 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 
601 GGCGATATTG CCAAAGCGAA AAGCGAGATT TACCAAGGTT TATGTTCAGA 
651 CGGCATTGCA CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 
7 01 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAGCGGCGAT 
751 GTTCACGCGG AAAATATTGT GCTGAAACCG TTGTCGTGCG AATTTGATTT 
801 GGTGTGCGGC GATGAGCGCG CCGCCGTGGT GCTGCCTGTT CCCGGCCGCC 
851 ACAATGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGCTGCGGGT 
901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAATATCAA 
951 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAACCCT GACAGCATGA AAGCTGCGAT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGCGAACTG GGCGAGGACG AAGCCGCCGC TATGCACGCC GAAGTCGGCG 

1151 CGTATGCCCG CGACCAAGGC ATCGAAGCGG CTTATTTTGT CGGCGACAAC 

1201 AGCGTCGAAG CGGCGGAAAA ATTTGGCGCG GACGGTTTGT GGTTCGCCGC 

1251 CAAAGACCCG TTGATTCAAG TGTTGCGCCA CGATTTGCCC GAACGCGCCA 

13 01 CCGTGTTGGT GAAAGGTTCG CGCTTTATGC AGATGGAAGA AGTGGTCGAG 

1351 GCATTGGAGG ATAAGTGA 

This corresponds to the amino acid sequence <SEQ ID 250; ORF 08 1>: 
m081 .pep 

1 MKPLDLNFIC QALKLPMPSE SKPVSRIVTD SRDIRAGDVF FALAGE RFD A 

51 HDFVEDVLAA GAAAVWS RE DCAAMDGALK VDDTLAALQT LAKAWRENVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDD AVLATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTXIAKP NAALVNNAMR AHVGCGFDGV 
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201 GDIAKAKSEI YQGLCSDGIA LIPQEDANMA VFKTATLNLN TRTFGIDSGD 

2 51 VHAENIVLKP LSCEFDLVCG DERAAWLPV PGRHNVHNAA AAAALALAAG 

3 01 LSLNDVAEGL KGFSNIKGRL NVKSGIKGAT LIDDTYNANP DSMKAAIDVL 

3 51 ARMPAPRIFV MGDMGELGEL GEDEAAAMHA EVGAYARDQG IEAAYFVGDN 

4 01 SVEAAEKFGA DGLWFAAKDP LIQVLRHDLP ERATVLVKGS RFMQMEEWE 
4 51 ALEDK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 081 shows 94.1% identity over a 455 aa overlap with a predicted ORF (ORF 081.ng) 
from N. gonorrhoeae: 

m081/g081 

10 20 30 40 50 60 

m081 . pep MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFAIiAGERFDAHDFVEDVLAA 

MMMMIMIilillllhlMIIIIMMill IMIIIIM MINIM MM 

9 081 MKPLDLNFICQALKLPMPSENKPVSRIVTDSRDIREGDVFFAliAGGRFDAHDFVGGVLSA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m08 1 . pep GAAAVWSREDCAAMDGALKVDDTLAALQTLAK^ 

MINIMI IN lh 1 1 1 M 1 1 ! I I 1 1 1 1 ! M f M ^ M I II 1 1 1 1 1 1 i I ! 1 1 i I 1 1 M I 

g0 8 1 GAAAVWS RED CAALGGALKVDDTLAALQTLAKAWRDNVNP FVFG I TGSGGKTTVKEMLA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m0 8 1 . pep AVLRRRFGDDAVLATAGNFNNHIGLPLTLLKIjNEKHRYAVIEMGMNHFGEIiAVLTXIAKP 

1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 11 ! I j I M { 1 1 I f I 1 1 f 1 f f I i 1 1 1 1 1 [ I 1 1 i 1 1 1 1 Nil 

AVLRRRFGDDAVS ATAGNFNNHI GLPLTLL KLNEKHRYAVI EMGMNHFGE LAVLTQ I AKP 
130 140 150 160 170 180 



gosi 



!90 200 210 220 230 240 

mO 8 1 . pep NAALVNNAMRAHVGCGFDGVGDI AKAKSEI YQGLCSDGI ALI PQEDANMAVFKTATLNLN 

: 1 1 1 M II M 1 1 1 II I II 1 1 M II I N I M I IINIIMIIIMIIIIIMIIMMM 

9081 DAAIiVNWALRAHVGCGFDGVGDI AKAKSEI YAGLCSDGMALIPQEDANMAVFKTATFNLN 

!90 200 210 220 230 240 

250 260 270 280 290 300 

mOSl.pep TRTFGIDSGDVHAENIVLKPLSCEFDLVCGDERAAVVLPVPGRHNVHNAAAAAALALAAG 

i I 1 1 : 1 1 I II M II II 1 1 1 1 II 1 1 II II I II I M I II I 1 1 N I II I 1 1 N I I II M II I 

TCTFGVDSGDVRAENIVLKPLSCEFDLVCGDERTAVVLPVPGRHNVHNAAAAAAIALAAG 
250 260 270 280 290 300 



g081 



310 320 330 340 350 360 

mO 8 1 . pep LSLNDVAEGLKGFSNI KGRLNVKSGI KGATLIDDTYNANPDSMKAAIDVLARMPAPRI FV 

M 1 1 11 1 1 II = 1 1 1 f 1 1 1 j 1 1 M = 1 1 M i M M I f M 1 1 1 1 1 1 1 1 ! = 1 1 M 1 1 1 [ 1 1 1 1 1 

LSLNDVAEGLQGFSNIKGRLNVKAGIKGATLIDDTYNANPDSMKAAVDVLARMPAPRIFV 
310 320 330 340 350 360 



g081 



37 0 380 390 400 410 420 

mO 8 1 . pep MGDMGELGELGEDEAAA^4HAEVGAYARD0GIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 

I INI III I N I II 1 1 II II II I M II II 1 1 II N II II II M 1 1 II II 1 1 1 II II I 

g081 MGDMGELGE - - -DEAAAMHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 

37 0 380 390 400 410 

430 440 450 

mO 8 1 . pep L I QVLRHDLPERATVLVKGSRFMQMEE WEALED KX 

N II I MMMMNNNINIMINMIMN 

9° 8 1 LIQVLSHDLPERATVLVKGSRFMQMEEWEALEDKX 
420 430 440 450 
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The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 25 1>: 

a081 . seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGTTT TTCGCATTGG CGGGCGGGCG GTTTGATGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCG GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGCGTTG CAATGGATGG CGCGTTGAAA GTCGATGACA 

251 CGCTTACCGC GTTGCAAATG TTGGCGAAGG CGTGGCGCGA GAATGTGAAC 

301 CCGTTTGTGT TCGGTATTAC CGGCTCGGGC GGCAAGACGA CGGTGAAGGA 

351 AATGTTGGCT GCGGTATTGC GCCGCCGTTT CGGCGATAAT GCCGTTTTGG 

4 01 CGACGGCAGG CAACTTCAAC AACCACATCG GATTGCCGTT GACTTTGTTG 

4 51 AAATTAAACG AAAAAC AC C G CTATGCCGTG ATTGAAATGG GTATGAACCA 

501 TTTTGGCGAA CTGGCGGTTT T G AC AC AAAT CGCCAAACCC GATGCCGCAT 

551 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 

601 GGCGATATTG CCAAAGCGAA AAGCGAGATT TATCAAGGCT TATGTTCAGA 

651 CGGCATGGCG CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 

701 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAGCGGCGAT 

751 GTCCACGCGG AAAATATCGT GCTGAAACCG TTGTCGTGCG AATTTGATTT 

801 GGTGTGCGGC AACGAGTGCG CAGCCGTGGT TCTGCCCGTT CCCGGCCGCC 

8 51 ACAATGTCCA CAACGCCGCC GCCGCCGCCG CGCTGTCTTT GGCTGCAGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAATATCAA 

951 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 

1001 AT ACT TAT AA TGCGAACCCT GACAGCATGA AAGCTGCGGT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGTGAGGAC GAAGCCGCCG CCATGCACGC CGAAGTCGGC GCGTACGCCC 

1151 GCGACCAAGG CATCGAAGCG GCTTATTTTG TCGGCGACAA CAGCGTCGAA 

1201 GCGGCGGAAA AATTTGGCGC GGACGGTTTG TGGTTCGCCG CCAAAGACCC 

1251 GTTGATTCAA GTGTTGCGCC ACGATTTGCC CGAACGCGCC ACCGTGTTGG 

1301 TGAAAGGTTC GCGCTTTATG CAGATGGAAG AAGTGGTCGA GGCATTGGAG 

1351 GATAAGTGA 



This corresponds to the amino acid sequence <SEQ ID 252; ORF 081.a>: 

a081 .pep 

1 MKPLDLNFIC QALKLPMPSE SKPVSRIVTD SRDIRAGDVF FALAGGRFDA 

51 HDFVE DVLAA GAAAVWSRE DCVAMDGALK VDDTLTALQM LAKAWRENVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDN AVLATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTQIAKP DAALVNNAMR AHVGCGFDGV 

201 GDIAKAKSEI YQGLCSDGMA LIPQEDANMA VFKTATLNLN TRTFGIDSGD 

251 VHAENIVLKP LSCEFDLVCG NECAAWLPV PGRHNVHNAA AAAALSLAAG 

301 LSLNDVAEGL KGFSNIKGRL NVKSGIKGAT LIDDTYNANP DSMKAAVDVL 

351 ARMPAPRIFV MGDMGELGED EAAAMHAEVG AYARDQGIEA AYFVGDNSVE 

401 AAEKFGADGL WFAAKDPLIQ VLRHDLPERA TVLVKGSRFM QMEEWEALE 

451 DK* 



m08i/a08i 96.7% identity over a 455 aa overlap 



10 20 30 40 50 60 

m081 . pep MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGERFDAHDFVEDVLAA 
i M I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
a081 MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGGRFDAHDFVEDVLAA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m081 . pep GAAAVWSRE DCAAMDGALKVDDTLAALQTLAKAWRENVNPFVFGITGSGGKTTVKEMLA 

I I 1 I I I I M I I I : I I I I I i I 11 I I I : M I I I I I I I I I E I I I I I i I I I I I I I I I I I I I I I 
a081 GAAAVWSRE DCVAMDGALKVDDTLTALQMLAKAWRENVNPFVFGITGSGGKTTVKEMLA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m081 . pep AVLRRRFGDDAVLATAGN FNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTXIAKP 

I I I I I I I I I : I I I I I I I II I M I I I I I I I I M I I I I I I I M I I I I I I I I II I I I I I I II 
a081 AVLRRRFGDNAVLATAGNFNNH I GLPLTLLKLNEKHRYAV IEMGMNHFGE LAVLTQIAKP 
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130 140 150 160 170 180 

190 200 210 220 230 240 

m081 . pep NAALVNNAMRAHVGCGFDGVGDIAKAKSEIYQGLCSDGIALIPQEDANMAVFKTATLNLN 
: M I I I II I I I I I I M I I I I I I I j I I I M I I I I I I I I I : I I M I I I I I I I i I I I I I I I I I 
a08l DAALVNNAMRAHVGCGFDGVGDIAKAKSEIYQGLCSDGMALIPQEDANMAVFKTATLNLN 

190 200 210 220 230 240 

250 260 270 280 290 300 

m081 . pep TRTFGIDSGDVHAENIVLKPLSCEFDLVCGDERAAWLPVPGRHNVHNAAAAAALALAAG 

If I II I I I II I I I I I I I I I i I j I : I M I I I I M I M I I M I : I i I I 

a081 TRTFGIDSGDVHAENIVLKPLSCEFDLVCGNECAAVVLPVPGRHNVHNAAAAAALSLAAG 

250 260 270 280 290 300 

310 320 330 340 350 360 

m081 . pep LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAIDVLARMPAPRIFV 
I ' I I I I J I I If I I M I II I I I I ! I I I I I I I I I | | | | | | | | | | | J 1 | : | | | | j | | | | j j | | 
*081 LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAVDVLARMPAPRIFV 

310 320 330 340 350 360 

370 380 390 400 410 420 

m081 . pep MG DMGE LGE LGE DEAAAMH AE VGAYARDQG IEAAY FVGDN S VE AAEKFGADGLWFAAKD P 

M I I I I I I I II I II M I I I M I I II I I | M I I II I i I M I I j | I | i | | | | | | | M I I 

a 08l MGDMGELGE DEAAAMHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 

370 380 390 400 410 

430 440 450 

m081 . pep LIQVLRHDLPERATVLVKGSRFMQMEEWEALEDKX 

M I I I I I I I I I I II I I I I I I I I I I I I I I | | | | | 1 | | 
aO 8 1 LIQVLRHDLPERATVLVKGSRFMQMEE WEALEDKX 

420 430 440 450 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 253>: 

g082.seq 

1 aTGTGGTTGT TGAAGTTGCC TGCCGTCGCC GAAACGGCAT CATCGCCGAA 

51 ACGGCGGCGC AATACCGCAG CCAGCATCTC CTTCACCGTC GTCTTGCCGC 

101 CCGAACCGGT AATGCCGAAC ACAAACGGGT TCACATTATC GCGCCACGCC 

151 TTCGCCAACG TTTGCAACGC GGCAAGCGTG TCATCGACTT TCAACGCGCC 

201 GCCCAAAGCC GCGCAATCTT CGCGCGAAAC CACAACCGCC GCCGCGCCCG 

251 CAGACAATAC GCCTCCAACA AAATCATGCG CGTCAAACCG CCCGCCCGCC 

3 01 AATGCGAAAA ACACATCGCC TTCCCGAATA TCGCGGCTGT CGGTTACGAT 
351 GCGCGACACG GGTTTGTTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

4 01 AGATGAAATT TAGGTCCAGT GGTTTCATAT TTGCTTTCGT TAATATTCGG 
451 GCGGCGGACA CATCGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 
501 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGCATTT TTTCTGTACG 
551 TATCATTTTT TAGACGTATT TTTAGCCGAT TTGCCTTTTC CCGCATACCA 
601 CGGCGCGGGG TCGTCGGACT GTCTGTCGAT AAAGGCAAGG TTATTGCCTT 
651 CGCCCGGCAC ATCGGGGACA TTCCCCCAAA AATCATAGCC GTCATCGGGC 
701 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 

This corresponds to the amino acid sequence <SEQ ID 254; ORF 082.ng>: 

90 82 .pep 

1 MWLLKLPAVA ETASSPKRRR NTAASISFTV VLPPEPVMPN TNGFTLSRHA 

51 FANVCNAASV SSTFNAPPKA AQSSRETTTA AAPADNTPPT KSCASNRPPA 

101 NAKNTSPSRI SRLSVTMRDT GLFSDGIGSL RAWQMKFRSS GFIFAFVNIR 

151 AADTSVAADF F I ACFAW KH RLFSHSHSAF FLYVSFFRRI FSRFAFSRIP 

201 RRGWGLSVD KGKV I AFARH IGDIPPKIIA VIGQLVGFDT RPTAESA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 255>* 

m082.seq 

1 ATGnnGTTGT TGAAGTTGCC TGCCGTCGCC AACACGGCAT CATCGCCGAA 

51 ACGGCGGCGC AATACCGCAG CCAGCATTTC CTTCACCGTC GTCTTGCCGC 

101 CCGAACCGGT AATGCCGAAC ACAAACGGAT TCACATTTTC ACGCCACGCC 

151 TTTGCCAGCG TTTGCAATGC GGCAAGCGTG TCATCGACTT TCAACGCGCC 



BNSDOCID: <WO 9957280A2J_> 



WO 99/57280 PCT/US99/09346 

269 



201 ATCCATTGCA GCACAATCTT CGCGCGAAAC CACAACCGCC GCCGCACCAG 

251 CAGCCAATAC GTCTTCAACA AAATCATGCG CGTCAAACCG CTCGCCCGCC 

301 AATGCGAAAA ACACATCGCC CGCGCGGATG TCGCGGCTGT CGGT TACGAT 

351 GCGCGACACG GGTTTGCTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

4 01 AGATGAAATT TAGGTCCAGT GGTTTCATAT TTACTTTCGT TAATATTCGG 

451 GCGGCGGACA CATCGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 

501 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGsATTT TTTCTGTACG 

551 TATCATTTTT TAGACGTATT TTTAGTCGAT TTGCCTTTTC CCGCATACCA 

6 01 CGGCGCGGGG TCGTCGGGCA GTC CGTCGAT AAAGGCAAGG TTATTGCCTT 

651 CGCCCTGCAC AT CGGGAAC A TTCCCCCAAA AATCAT AG CC GTCATCGGGC 

701 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 

This corresponds to the amino acid sequence <SEQ ID 256; ORF 082>: 

m082.pep 

1 MXLLKLPAVA NTASSPKRRR NTAASISFTV VLPPEPVMPN TNGFTFSRHA 

51 FASVCNAASV SSTFNAPSIA AQSSRETTTA AAPAANTSST KSCASNRSPA 

101 NAKNTS P ARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS GFIFTFVNIR 

151 AADT5VAADF FIACFAW KH RLFSHSHSXF FLYVSFFRRI FSRFAFSRIP 

201 RRGWGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT RPTAESA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 082 shows 92.7% identity over a 247 aa overlap with a predicted ORF (ORF 082. ng) 
from N. gonorrhoeae: 

m082/g082 

10 20 30 40 50 60 

m082 . pep MXLLKLPAVANTASS PKRRRNTAAS I S FTWLPPEPVMPNTNGFT FSRHAFASVCNAAS V 

I IMMIIhMIIMIIIIIIIMMIIMIIIMIMIIIIhlMllhMIIIII 

g0 82 MWLLKIiPAVAETASSPKRRRNTAASISFTWLPPEPVMPNTNGFTLSRHAFANVCNAASV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m082 . pep SSTFNAPSIAAQSSRETTTAAAPAANTSSTKSCASNRSPANAKNTSPARMSRLSVTMRDT 

Illllll llllllllllllill II MINIM 1 1 1 1 1 1 i M = 1 : 1 1 1 II 1 1 1 IS 

g082 SSTFNAPPKAAQSSRETTTAAAPADNTPPTKSCASNRPPANAKNTSP SRI SRLSVTMRDT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m082 . pep GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAWKHRLFSHSHSXF 

I I = 1 1 1 1 E ! I! 1 1 1 1 1 1 1 M M 1 1 = I i 1 1 f 1 1 1 M 1 M M I ! I ! I ! I 1 1 1 1 M 1 1 1 1 1 I 

g082 GLFSDGIGSLRAWQMKFRSSGFIFAFVNIRAADTSVAADFFIACFAWKHRLFSHSHSAF 

130 140 150 160 170 180 

190 200 210 220 230 240 

tnO 8 2 . pep FLYVSFFRRI F S RFAFS R I PRRGWGQSVD KGKVIAFALH I GN I PPKI I AVI GQLVGFDT 

II I ! 1 M I I I I I M M I III I I I III IIIMIIIIII I 1 1 = 1 I I 11 1 I 1 1 [ 1 1 M M 1 

g0 8 2 FLYVSFFRRI FSRFAFSRI PRRGWGLSVDKGKVIAFARHIGDI PPKI I AVI GQLVGFDT 

190 200 210 220 230 240 

m0 8 2 . pep RPTAES AX 

IIIMIII 

g0 8 2 RPTAESAX 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 257>: 

a082 . seq 

1 ATGTGGTTGT TGAAGTTGCC TGCCGTCGCC AAAAC G G CAT TATCGCCGAA 
51 ACGGCGGCGC AATACCGCAG CCAACATTTC CTTCACCGTC GTCTTGCCGC 
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101 

151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 



CCGAGCCGGT 
TTCGCCAACA 
ATCCATTGCA 
CAGCCAATAC 
AATGCGAAAA 
GCGCGACACG 
AGATGAAATT 
GCGGCGGACA 
GGTAAAACAC 
TATCATTTTT 
CGGCGCGGGG 
CGCCCTGCAC 
AACTCGTCGG 



AATACCGAAC 
TTTGCAACGC 
ACGCAATCTT 
GTCTTCAACA 
ACACATCGCC 
GGTTTGCTTT 
TAGGTCCAGT 
CATCGGTAGC 
AGATTATTTT 
TAGACGTATT 
TCGTCGGGCA 
ATCGGGAACA 
TTTCGATACC 



ACAAACGGGT 
GGTAAGCGTG 
CGCGCGAAAC 
AAATCATGCG 
CGCGCGGATG 
CAGACGGCAT 
GGTTTCATAT 
GGCTGATTTT 
CCCATTCTCA 
TTTAGTCGAT 
GTCCGTCGAT 
TTCCCCCAAA 
CGTCCAACTG 



TCACATTCTC 
TCATCGACTT 
CACAACCGCC 
CATCAAACCG 
TCGCGGCTGT 
CGGAAGCTTG 
TTACTTTCGT 
TTTATCGCCT 
TTCGGCATTT 
TTGCCTTTTC 
AAAGGCAAGG 
AATCATAGCC 
CCGAATCCGC 



GCGCCACGCC 
TCAACGCGCC 
GCCGCACCCG 
CCCGCCCGCC 
CGGTTACGAT 
AGGGCTTGGC 
TAATATTCGG 
GTTTTGCTGT 
TTTCTGTACG 
CCGCATACCA 
TTATTGCCTT 
GTCATCGGGC 
GTAA 



This corresponds to the amino acid sequence <SEQ ID 258; ORF 082.a>: 

a082 .pep 

MWLLKLPAVA KTALSPKRRR NTAANISFTV VLPPEPVIPN 

FANICNAVSV SSTFNAPSIA TQSSRETTTA AAPAANTSST 

NAKNTSPARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS 

A ADTSVAADF FIACFAW KH RLFSHSHSAF FLYVSFFRRI 

RRGWGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT 



1 
51 
101 
151 
201 



TNG FT FSRHA 
KSCASNRPPA 
GFIFTFVNIR 
FSRFAFSRIP 
RPTAESA* 



m082/a082 95.5% identity over a 247 aa overlap 



m082.pep 
a082 

m082 .pep 
a082 

m082 .pep 
a082 

m082 .pep 
a082 



10 20 30 40 50 60 

MXLLKLPAVANTASSPKRRRNTAASISFTWLPPE PVMPNTNGFTFSRHAFASVCNAASV 
1 ! > I I M I I : I ! M M I I I I I I : I I I M I I I I M ! : j I E I 1 I I I j I I j I | : : | M : j j 
MWLLKLPAVAKTALSPKRRRNTAANISFTWLPPEPVIPNTNGFTFSRHAFANICNAVSV 

10 20 30 40 50 60 

70 80 90 100 110 120 

SSTFNAPSIAAQSSRETTTAAAPAANTSSTKSCASNRSPANAKNTSPARMSRLSVTMRDT 
M M I I I I I I : I I I I M I I I I I M I I I I I I M I I I I I 1 I I I I I M I | | t I ! I I ! 1 I I I I 
SSTFNAPSIATQSSRETTTAAAPAANTSSTKSCASNRPPANAKNTSPARMSRLSVTMRDT 
70 80 90 100 110 120 

13 ° 140 150 160 170 180 

GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAWKHRLFSHSHSXF 

i M I I I I I I I I I I I M I I M I t I I M I 1 II I If I I I I i I I I N | II 11 M I I 

GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAWKHRLFSHSHSAF 

130 140 150 160 170 180 

19 ° 200 210 220 230 240 

FLYVSFFRRIFSRFAFSRIPRRGWGQSVDKGKVIAFALHIGNIPPKIIAVIGQLVGFDT 
11 11 ' 1 I > I I i I I I I f I M I I I I II 1 I 1 I I I I | | M 1 I II I I I I I I I | | ! | | 1 M I I! I I 
FLYVSFFRRIFSRFAFSRIPRRGVVGQSVDKGKVIAFALHIGNIPPKIIAVIGQLVGFDT 

190 200 210 220 230 240 



m082.pep RPTAESAX 
I M I I I I I 
a082 RPTAESAX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 259>- 

g084.seq ^ 

1 ATGAAa c a AT CCGcccgaat aAAAAATATG GATCAGACAT TAAAAAATAc 

51 attgggcatt tGCGCGcttt tagcctTTTG TTTTggcgcG gccaTCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGC 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GCTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCTT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

3 51 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 
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4 01 TTTGGAAATA TTGTG TATCT GTGGGGGTAT TTGCTGACGT AAAAAACTAT 
451 AAACGTCGCA GCAAAATATG GCTGACCATA TTATTGACTT TGATTTTGTC 

5 01 CTGCGCGGTG ATGGAGAAAA TCGccggcga taaAGATTGG CGAGaacctg 
551 atgccggcct gttgttgaat ATTTTcgacc tgtattaCga cttggctttc 
601 cgcgccggca cAATATGCCG CCAAGCGCGC CCAcattttg gaagCagcaa 
651 aaaaagcgtC AACATGGCAt atccgccaac ttgcgcccaa gTAtaa 

This corresponds to the amino acid sequence <SEQ ID 260; ORF 084.ng>: 

g084 .pep 

1 MKQSARI KNM DQTLKNTLGI CALLAFCFGA AIASG YHLEY EYGYR YSAVG 

51 ALASWFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WKYCVS VGVFADVKNY 

151 KRRS KIWLTI LLTLILSCAV ME KI AGDKDW REPDAGLLLN I FDLY YDLAF 

201 RAGTICRQAR PHFGSSKKSV NMAYPPTCAQ V* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 261 >: 

m084 . seq 

1 ATGAAACAAT CCGCCcGAAT AAAa.ATATG AATCAGACAT TACTTTATAC 

51 ATTGGGCATT TGCGCGCTTT TAACCTTTnn nnnnnnnnnn nnnnnrmnnn 

101 nnnnnTATCA CCCnGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GTTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTTATCAGAT AGTCGGTTCG 

3 01 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 
351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 

4 01 TTTGGAAATA TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 

4 51 AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC 
501 CTGCGCGGTG ATGGATAAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG 

5 51 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCT.TC 
601 CGCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 
651 AAAAAGCGTC AACATGGCAT ATCCGTCATG TTGCGCCCAA GTATAA 

This corresponds to the amino acid sequence <SEQ ID 262; ORF 084>: 

m084 .pep 

1 MKQSARI KXM NQTLLYTLGI CALLTFXXXX XXXXX YHPEY EYGY RYSAVG 

51 ALASWFLLL LA RGFP RVSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WKYCVS GGVFADVKNY 

151 KRRS KIWLTI LLTLILSCAV MD KIASDKDL REPDAGLLLN I FDLYYDLAX 

201 RAGTICRQAR PHFGSSKKSV NMAYPSCCAQ V* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 084 shows 90.5% identity over a 231 aa overlap with a predicted ORF (ORF 084.ng) 
from N, gonorrhoeae: 

m084/g084 

10 20 30 40 50 

m084 . pep MKQSARI KXMNQTLLYTLGICALLTF YHPEYEYGYRYSAVGALASWFLLL 

Illlllil hill lllilllhl II MMIM 1 1 1 1 M I II 1 1 Ml 

g084 MKQSARIKNMDQTLKNTLGICALLAFCFGAAIASGYHLEYEYGYRYSAVGALASWFLLL 

10 20 30 40 50 60 

60 70 80 90 100 110 

m084 . pep LARGFPRVSSWLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 

Illlll I MM I I III I I lllllll I i I II I I M! I ! MM 111 ! MM I I i M I I I II I 

g084 LARGFPRVSSWLLI YVGTTALYLPVGWLYGAPSYQIVGS I LESNPAEAREFVGNLPGSL 

70 80 90 100 110 120 

120 130 140 150 160 170 

m084 . pep YFVQALFFIFGLTVWKYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 

MM 1 1 I I Ml Ml II I !M MM II Ml III II IMIIMMI I M II MM Ml I 

9084 YFVQALFF I FGLTVWKYCVSVGVFADVKNYICRRSKI WLTI LLTLI LSCAVMEKI AGDKDW 

130 140 150 160 170 180 
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180 190 200 210 220 

m0 84 . pep REPDAGLLLNI FDLYYDLAXRAGTICRQARPHFGSSKKSVNMAYPSCCAQVX 

IIIMIMMIIMIIIII 1 1 1 ! 1 1 1 1 F M 1 1 1 1 1 1 I [ 1 1 i 1 j i Mill 

9° 8 4 REPDAGLLLNI FDLYYDLAFRAGTI CRQARPHFGSSKKSVNMAYPPTCAQVX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 263>* 

a084. seq 

1 AT GAAACAAT CCGCCCGAAT AAAAAATATG GATCAGACAT TAAAAAATAC 

51 ATTGGGCATT TGCGCGCTTT TAGCCTTTTG TTTTGGCGCG GCCATCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TT TAT TAT T A TTGGCACGCG GTTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 

4 01 T T T G GAG AT A TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 

4 51 AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG AT G G AT AAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG 

551 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCTTCC 

601 . GCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 

651 AAAAAGCGTC AACATGGCAT ATCCGTCATG TTGCGCCCAA GTATAA 

This corresponds to the amino acid sequence <SEQ ID 264; ORF 084.a>: 

a084.pep 

1 MKQSARIKNM DQTLKNTLGI CALLAFCFGA AIASG YHLEY EYGYRYSAVG 

51 ALASWFLLL LARGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WRYCVS GGVFADVKNY 

151 KRRSK IWLTI LLTLILSCAV MP KIASDRDT. REPDAGLLLN IFDLYYDLAS 

201 XAGTICRQAR PHFGSSKKSV NMAYPSCCAQ V* 

m084/a084 92.2% identity over a 231 aa overlap 



m084 .pep 
a084 

mO 8 4 . pep 
a084 

m084 . pep 
a084 

m084 .pep 
a084 



10 20 30 40 50 60 

MKQSARIKXMNQTLLYTLGICALLTFXXXXXXXXXYHPEYEYGYRYSAVGALASWFLLL 
IMIIIII |:||| I I I M ! I I : ! II I M I i j I I I I I I I 1 I i M I | I I 

MKQSARIICNMDQTLKNTLGICALLAFCFGAAIASGYHLEYEYGYRYSAVGALASWFLLL 
10 20 30 40 50 



60 



70 



80 



90 100 110 120 

LARGFPRVSSVVLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEARE FVGNLPGSL 
M M M I M I I M I M M M I M I M M i M M I M M M I I 1 I I I I i I M M M I I II I 
LARG FPRVSSVVLL I YVGTTALYLPVGWLYGAPSYQIVGS I LESNPAE ARE FVGNLPGSL 
70 80 90 100 110 120 

130 140 150 160 170 180 

YFVQALFFIFGLTVWKYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIAS DKDL 
Ml I I II I ! I if | m M II M I II I I I I I I | | || M I M I II I I M M I M I I N I I I i I 
YFVQALFFI FGLTVWRYCVSGGVFADVKNYKRRSKIWLTI LLTLI LS CAVMDKI AS DKDL 

130 140 150 160 170 180 

190 200 210 220 230 

REPDAGLLLNI FDLYYDLAXRAGTICRQARPHFGSSKKSVNMAYPSCCAQVX 
M M II I I II I I i II I I I I || | M II I I I I M I I I I I M I II I II I I I II 
REPDAGLLLNI FDLYYDLASXAGTICRQARPHFGSSKKSVNMAYPSCCAQVX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 265>* 

g085.seq ^ 

1 ATGGGCAAAG GGCAGGACTT CACGCCCCTG CGCGACGCGT TGAAAGATAA 
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51 GGCAAAAGGC GTGTTCCTGA TCGGCGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AACCTGACCG ACTGCGTCAC TTTGGAAGAG 

151 GCGGTTCAGA CGGCATACG C CCAAGCCGAA GCGGG CG AT A TTGTCTTGCT 

201 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT tatCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 266; ORF 085. ng>: 

g085 .pep 

1 MGKGQDFTPL RDALKDKAKG VFLIGVDAPQ IRRDLDGCGL NLTDCVTLEE 
51 AVQTAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIEA FKAL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 267>: 

mOB5 . seq 

1 ATGGGTAAAG GGCAGGACTT CACGCCCCTG CGCGATGCAC TGGTAGGCAA 

51 GGCAAAAGGC GTGTTCTTGA TTGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AATATGACCG ACTGCGCCAC TTTGGGAGAA 

151 GCCGTTCAGA CGGCATATGC CCAAGCCGAA GCAGGCGATA TTGTGTTGCT 

201 CAGCCCCGCC TGCGCGAGCT TTGATATGTT CAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT TATCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 268; ORF 085>: 

m085 .pep 

1 MGKGQDFTPL RDALVGKAKG VFLIGVDAPQ IRRDLDGCGL NMTDCATLGE 
51 AVQTAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIEA FKAL* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from 7V. gonorrhoeae 

ORF 085 shows 94.7% identity over a 94 aa overlap with a predicted ORF (ORF 085. ng) 
from N. gonorrhoeae: 

m085/g085 



10 20 30 40 50 60 

mO 8 5 . pep MGKGQDFTPLRDALVGKAKGVFLIGVDAPQIRRDLDGCGLNMTDCATLGEAVQTAYAQAE 
! I 1 I I I II Ml I II ! II I I I ll II IIMI illl III I I hi | | :| | IIIIMMMI 
g085 MGKGQDFTPLRDALKDKAKGVFLIGVDAPQIRRDLDGCGLNLTDCVTLEEAVQTAYAQAE 

10 20 30 40 50 60 



70 80 90 

m085 .pep AGDI VLLS PACAS FDMFKG YAHRS EVF I E AFKALX 

IIIIIIIIIIIIIIIIIIMMIIIMIMIMII 

g085 AGDIVLLS PACAS FDMFKGYAHRSEVFI EAFKALX 

70 80 90 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 269>: 

a085 . seq 

1 ATGGGCAAAG GGCAGGACTT CACGCCCCTG CGCGACGCGC TTGCCGGCAA 

51 GGCAAAAGGC GTGTTCCTGA TCGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGATCTG AATATGACCG ACTGCGCCAC TTTGGAAGAA 

151 GCGGTTCAGA AGGCATATGC CCAAGCCGAA GCGGGCGATA TCGTGCTGCT 

201 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT TATCGGGGCG TTTAAGGCTT TGTGA 



This corresponds to the amino acid sequence <SEQ ID 270; ORF 085. a>: 

a085 ,pep 

1 MGKGQDFTPL RDALAGKAKG VFLIGVDAPQ IRRDLDGCDL NMTDCATLEE 
51 AVQKAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIGA FKAL* 

m085/aOB5 94.7% identity over a 94 aa overlap 

10 20 30 40 50 60 

m085 . pep MGKGQDFTPLRDALVGKAKGVFLIGVDAPQIRRDLDGCGLNMTDCATLGE AVQTAYAQAE 
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a085 



1 1 1 1 I I I I M II I I : I I I I I I II II I I M M I M I I I I M I I I I I I I mm MINI 
MGKGQDFTPLRDALAGKAKGVFLIGVDAPQIRRDLDGCDLNMTDCATLEEAVQKAYAQAE 
10 20 30 40 50 60 



70 80 90 

m085 .pep AGDIVLLSPACASFDMFKGYAHRSEVFIEAFKALX 
M M I I I I ! I II I I I ! I M I I I M I I I I II M II 
a08 5 AGDIVLLSPACASFDMFKGYAHRSEVFIGAFKALX 

70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 271 >* 

g086.seq ^ 

1 ATGGTGGTGC TGATGACGGC GTTCGGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCGAAGG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GGCAGGCGGG GTTCGTCGTT GCCGGCCTTA TAGCGAGCGG TTTTTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CTTATCCGGC CTGTTGCTGG TAGCCGTATT GATTGCCGGG CGCGAAATCA 

251 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

3 01 GAGCTGTTCA AGCTGGCAGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

351 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGT CGATTT 

401 GGCGGGGGAC GGCCAACCTG ATTATGTCCG CCACCAATCC GCAGG CACGT 

451 CGTGAAACAT TAGAAATGTA CGgcCGTTTC CGGGCGATCA TCCTGCCGAT 

501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 

551 GTTCGTTTGT CGTCATTACC GTCATTACCG TTGGAATGCT GTTTCTGGCA 

601 GGATTGCCGT GGAAATATTT TTTTGTCCTG GTAGGCAGCG TCTTGGGTGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCTGAA GAATT CGGCT TCTTCGGGAT GTGCGTGCTG 

901 ATATT CTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGtttgA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGctgCcg tTGATGTCCT ATGGcggTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTTG CGTATCGATT 

1151 ATGAAAACCG CCAGAAAATG CGCGGTTACC GGGTGGAGTA AA 

This corresponds to the amino acid sequence <SEQ ID 272; ORF 086.ng>- 
goee.pep * & 

1 MWLMTAFGL LMIYSASVYL ASKEGGDQFF YLTRQ AGFW AGLIASGFLW 

51 FLCRMRTWRR LVPWIFALSG LLLVAVLIA G REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATOPQAR 

151 RETLEMYGRF R AIILPIMLV AFGLVLIMV O PDFGS FWIT VI TVGMLFL A 

201 GLPWK YFFVL VGSVLGGMVL MITA APYRvn RWAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 

3 01 IFCYGWLWR AFSIGKQSRD LGLTFNAYIA SGIGIWIGIQ SF FNIGVN IG 

351 ALPTKGLTLP LMSY GGSSVF FMLISMMLLL RID YENPnTTM RGYRVE* 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 273>- 

m086.seq ^ 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG G CATCAAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CCTATCCGGC CTGTTGCTGG TAGTCGTATT GATTGCCGGG CGCGAAATCA 

251 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACc 

3 01 GAGCTGTTCA AGCtGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 
351 CCGTGAAGAA GTGTTGcGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

4 01 GGCGGGGGAC GGCCAATCTG ATCATGTCCG CCACCAATCC GCAGrCACGT 
451 CGTGAaACAT TAGAAATGTA CGGCCGTwTC CGGGCGATCA TCCTGCCGAT 
501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 
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551 GTTCGTTTGT CGTCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

601 GGATTGCCGT GGAAAT ATT T TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

6 51 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGG T AGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

8 01 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGG CAAGC A 

951 GTCGCGCGAT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGkrTCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA mCAAAgGyCT GACGCyGCCG Tg . AtGTCCw ATGGCGGTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTkG CGTATAGATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTATC GGGTGGAGTA A 

This corresponds to the amino acid sequence <SEQ ED 274; ORF 086>: 

m086 -pep 

1 MWLMTAFSL LMIYSASV YL ASKEGGDQFF YLTRQ AGFW AGLIASGLLW 

51 FLCRMRTWRR L VPWIFALSG LLLWVL I AG REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQXR 

151 RETLEMYGRX R AIILPIMLV AFGLVLIMV Q PDFGS FWIT VIAVGMLFLA 

201 GLPW KYFFVL VGSVLGGMVL MI T AAPYRVQ RWAFLD PW K DPQGAGYQLT 

2 51 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 
301 IFCYGWLWR AFSIGKQSRD LGLTFNAYIA SGIGIWIGXQ SF FNIGVNIG 

3 51 ALPXKGLTXP XMSX GGSSVF FMLISMMLLX RI DYENRRKM RGYRVE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 086 shows 96.7% identity over a 396 aa overlap with a predicted ORF (ORF 086.ng) 
from N. gonorrhoeae: 

m086/g086 



10 20 30 40 50 60 

m086 . pep MWLMTAFSLLMIYSASVYIJVSKEGGDQFFYLTRQAGFVVAGLIASGLLWFLCRMRTWRR 

M 1 1 1 M I = 1 1 1 1 M 1 1 i 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 = 1 1 1 M 1 1 1 1 1 M 

g086 MWLMTAFGLLMI YSASVYLAS KEGGDQFF YLTRQAGFWAGLI ASGFLWFLCRMRTWRR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m086 . pep LVPWI FALSGLLLVWLIAGREINGATRWIPLGPLNFQPTELFKLAVILYLASLFTRREE 

III II II MM M IM II IMM MM ! M I IMMIMIMI 1 1! I M M I Ml I MM 

g086 LVPWI FALSGLLLVAVLIAGREINGATRWIPLGPLNFQPTELFKLAVILYLASLFTRREE 

70 80 90 100 110 120 



130 140 150 160 170 180 

m086 . pep VLRSMESLGWQS IWRGTANLIMSATNPQXRRETLEMYGRXRAI ILPIMLVAFGLVLIMVQ 

Ml S M I Ml M I IM M Mill II! I I MIMMIM 1 1 1 1 I I I I M I I I 1 1 1 1 1 1 1 

g0 8 6 VLRSMESLGWQS IWRGTANLIMSATNPQARRETLEMYGRFRAI I LP IMLVAFGLVLIMVQ 

130 140 150 160 170 180 



190 200 210 220 230 240 

m086 . pep PDFGSFWITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 

MM I I MM I Mill I MM I I I II II I I III I MUM II II I I I Ml i I I Ml M M 

g086 PDFGSFWITVITVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 

190 200 210 220 230 240 



250 260 270 280 290 300 

m086 . pep DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFI FAI lAEEFGFFGMCVL 

I Ml I II IM I I I Ml Mill! I I III I I I MM IMM I IM III I II I M llllll M 

g0 86 DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFI FAI lAEEFGFFGMCVL 

250 260 270 280 290 300 

310 320 330 340 350 360 
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m086 . pep I FCYGWLWRAFS I GKQSRDLGLTFNAYI ASGI G I WI GXQS FFNI GVNI GALPXKGLTXP 

IIMIIIIIIiMMIIMlililllllllllllllM MMIIIililMhmH 

9086 I FCYGWLWRAFS I GKQSRDLGLTFNAYI ASGI GIWIGIQSFFNIGTOIGALPTKGL^ 

310 320 330 340 350 360 

370 380 390 

mO 8 6 . pep XMSXGGSSVFFMLI SMMLLXRIDYENRRKMRGYRVEX 

II IIIMMIIIIIIII IIIMIhllMMMI 

9 0 8 6 LMS YGGS S VFFML I SMMLLLR ID YENRQ KMRG YRVEX 

370 380 390 



The following partial DNA sequence was identified in K meningitidis <SEO ID 275>- 

a086.seq ^ 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCAAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CCTATCCGGC CTGTTGCTGG TAGTCGTATT GATTGCCGGG CGCGAAATCA 

251 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

301 GAGCTGTTCA AGCTGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

351 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

4 01 GGCGGGGGAC GGCCAATCTG ATCATGTCCG CCACCAATCC GCAGGCACGT 

4 51 CGTGAAACAT TAGAAATGTA CGGCCGTTTC CGGGCGATCA TCCTGCCGAT 

501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 

551 GTTCGTTTGT CGTCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

601 GGATTGCCGT GGAAATATTT TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

7 01 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGCTGCCG TTGATGTCCT ATGGCGGTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTTG CGTATAGATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTACC GGGTGGAGTA A 

This corresponds to the amino acid sequence <SEQ ID 276; ORF 086 a>- 

a086.pep 

1 MVVLMTAFSL LMIYSASV YL ASKEGGDQFF YLTRQ AGFW AGL IASGLLW 

51 FLCRMRTWRR L VPWIFALSG LLLVWLI AG REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQAR 

151 RETLEMYGRF R AIILPIMLV AFGLVLIMV O PDFGS FWIT VIA VGMLFLA 

201 GLPWK YFFVL VGSVLGGMVL MITA APYRVO RWAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 

301 IFCYGWLVVR AFSIGKQSRD LGLTFNAYIA SGIGIWIGIQ S FFNIGVNIG 

351 ALPTKGLTLP LMSYG GSSVF FMLISMMLLL RI DYENRPTCM RGYRVE* 

moee/aoee 98.0% identity over a 396 aa overlap 



10 20 30 40 50 60 

MWLMTAFSLLMIYSASVYLASKEGGDQFFYLTRQAGFVVAGLIASGLLWFLCRMRTWRR 
MIIINMMll,l| ill!IMMIIIMMIIIII!||MIIMMIIIIIMMMI 
a086 MWLMTAFSLLMIYSASVYLASKEGGDQFFYLTRQAGFVVAGLIASGLL^ 

10 20 30 40 50 60 

70 80 90 100 HO 120 

tyr^T^ sGLLL ^ LiAGREi ^^ 



mO 8 6 . pep 



m086 . pep 



noc 1 1 1 1 1 1 I I I I I I I I I I I I I M I I I I I I N I I I I I I I I I I I I I | | | | I | | I i I I | | | I | 1, 

a086 LVPWIFALSGLLLVWLIAGREINGATRWIPLGPLNFQPTELFK^VILY^SLFTRREE 
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70 80 90 100 110 120 

130 140 150 160 170 180 

m08 6 .pep VLRSMESLGWQSIWRGTANLIMSATNPQXRRETLEMYGRXRAIILPIMLVAFGLVLIMVQ 
I II I I M I I I I I I I I I II I II 1 I I II I I I I I I I I I I t I I I I I I I I I I I II I I I I I I I I 
a08 6 VLRSMESLGWQSIWRGTANLIMSATNPQARRETLEMYGRFRAIILPIMLVAFGLVLIMVQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

m08 6 . pep PDFGSFVVITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRVVAFLDPWK 
I II I I I I I ! I I I M I I I If I I I I I I I I I ! I I 11 I I I I I M I I II I t I I I I I I I I I I I i I I 
a086 PDFGSFVVITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m08 6 . pep DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFIFAIIAEEFGFFGMCVL 
M I I I I I I I I I I I I I I I I I I I I I I I I I I 11 M I I I I II I I I I M I I I I I I I I I I I I I I II 
a086 DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFIFAIIAEEFGFFGMCVL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m086.pep IFCYGWLWRAFSIGKQSRDLGLTFb3AYIASGIGIWIGXQSFFNIGVNIGALPXKGLTXP 
I I M I I I I I I I II I I I II I I I I I I I I I I ! M I I I I I I I I I M I M I I I I I I I : I M I I 
a08 6 IFCYGWLWRAFSIGKQSRDLGLT FNAYIASGIGIWIGIQSFFNIGVNIGALPTKGLTLP 

310 320 330 340 350 360 

370 380 390 

m086.pep XMSXGGSSVFFMLISMMLLXRIDYENRRKMRGYRVEX 

I I I I i I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

a08 6 LMSYGGSSVFFMLISMMLLLRIDYENRRKMRGYRVEX 

370 380 390 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 277>: 

g087 . seq 

1 ATGGGCGGTA AAACCTTTAT GCTGATGGCG GGCGGAACGG GCGGACACAT 

51 TTTCCCAGCT CTGG CTGTGG CGGATTCATT GCGCGTGCGC GGT CATCATG 

101 TAAT TTGGCT GGGCAGCAAG GATTCGATGG AAGAGCGCAT CGTGCCGCAA 

151 TACGGCATAC GCTTGGAAAC GCTGGCGATT AAAGGAATAC GCGGCAACGG 

201 CATCAAACGC AAGCTGATGC TTCCGTTTAC TCTGTACAAA ACCGTCCGCG 

251 AAGCGCAGCG GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 

301 GGCGGTTTTG TTACCTTTCC CGGCGGTCTG GCGGCGAAAC TCTTGGGCGT 

351 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGCTTG TCCAACCGCC 

401 Ac C TGTCGCg ctGGGCGAAA CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 

451 AGCCACGAAG GCGGTTTGGT CGGCAACCCC GTCCGCGCCG ATATTAGCAA 

501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG GCGCGAAGGC CGTCTGAAAA 

551 TTTTGGTGGT CGGCGGCAGT TTGGGTGCGG ACGTTTTGAA CAAAACCGTA 

601 CCGCAGGCGT TGGCACTGCT GCCTGAAGAG GTGCGCCCGC AGATGTACCA 

651 CCAGTCGGGG CGTAACAAGC TGGGCAATCT TCAGGCGGAT TATGACGCGT 

701 TGGG CGTGAA AGCGGAATGC GTGGAATTTA TTACCGACAT GGTGTCCGCC 

751 TACCGTGATG CCGATTTGGT GATTTGCCGT GCCGGCGCGC TGACGATTGC 

8 01 CGAGTTGACG GCGGCGGGGC TGGGCGCGTT GTTAGTGCCG TATCCTCACG 

851 CCGTTGATGA CCATCAAACC GCCAACGCGC GTTTCATGGT GCAGGCAGAA 

901 GCGGGGCTGC TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA 

951 AATCCTCGGC AGCCTCAACC GCGAAAAATG CCTCAAATGG GCGGAAAACG 

1001 CCCGTACGTT GGCATTGCCG CACAGCGCGG ATGACGTTGC CGAAGCCGCG 

1051 ATTGCGTGTG CGGCGTAAA 

This corresponds to the amino acid sequence <SEQ ID 278; ORF 087.ng>: 

g087 .pep 

1 MGGKTFMLMA GGTGGHI FPA LA V AD S L R VR GHHVIWLGSK DSMEERIVPQ 

51 YGI RLE TLA I KGIRGNGIKR KLMLPFTLYK TVREAQRI I R KHRVE CVIGF 

101 GGFVTFPGGL AA KLLGVP I V IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 
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278 



201 



PQALALLPEE VRPQMYHQSG RNKLGNLQAD YDALGVKAEC VEFITDMVSA 
251 YRDADLVICR AG ALTIAELT AAGLGALLV P YPHAVDDHQT ANARFMVQAE 
351 IACAA* QTQL TAEKLAEILG SLN REKCLKW AENARTIALP HSADDVAEAA 

f m 1 J87?e 8 q Partial DNA sequence was ide ntified in N. meningitidis <SEQ ID 279>: 

1 ATGGGCGGTA AAACCTTTAT GCTGAwkkCG GGCGGAACGG GCGGACATAT 
51 TTTCCCCGCG CTGGCGGTGG CGGATTCATT GCGCGCGCGC GGCCATCATG 
101 TGATTTGGCT GGGCAGCAAG GATTCGATGG AAGAGCGTAT CGTGCCGCAA 
151 TACGGCATAC GCTTGGAAAC GCTGGCGATT AAAGGCGTGC GCGGCAACGG 
201 CATCAAACGC AAACTGATGC TGCCGGTTAC TTTGTATCAA ACCGTCCGCG 
251 AAGCGCAGCG GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGG CTTC 
301 GGCGGCTTCG TTACCTTCCC CGGCGGTTTG GCGGCGAAGC TATTArGCGT 
351 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGTTTG TCCAACCGCC 
401 ACCTGTCGCG CTGGGCGAAG CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 
451 AGCCACGAAG GCGGCTTGGT CGGCAACCCC GTCCGCGCCG ATATTAGCAA 
501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG GCGTGAAGGC CGTCTGAAAA 
553 TTTTGGTGGT CGGCGGCAGT TTGGGCGCGG ACGTTTTGAA CAAAAC CGTA 
CCGCATGCAT TGGCTTTGCT GCCCGACAAT GCGCGTCCGC ATATGTACCA 
CCAATCGGGA CGGGGCAAGC TGGGCATCTT GCAGGCGnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnGCGGGAT TGGGTGCGTT GTTAGTGCCG TATCCTCACG CGGTTGACGA 
TCACCAAACC GCCAACGCGC GTTTTATGGT GCAGGCGGAG GCGGGATTGC 
TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA GATTCTCGGC 
GGCTTAAACC GCGAAAAATG CCTCAAATGG GCAGAAAACG CCCGTACGTT 
951 GGCACTGCCG CACAGTGCGG ACGACGTGGC GGAAGCCGCG ATTGCGTGTG 
1001 CGGCGTAA ""^ 

This corresponds to the amino acid sequence <SEQ ID 280- ORF 087>- 

m08 7.pep ' 

1 MGGKTFMLXX GGTGGHIFPA LAVADSLRAR GHHVIWLGSK DSMEERIVPO 
1 YGIRLETLAI KGVRGNGIKR KLMLPVTLYQ TVREAQRIIR KHRVECVIGF 
GGFVTFPGGL AAKLLXVP IV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 
SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 
PHALALLPDN ARPHMYHQSG RGKLGILQAX XXXXXXXXXX XXXXXXXXXX 
XAGLGALLVP YPHAVDDHQT ANARFMVQAE AGLLLPQTQL TAEKLAEILG 
_ GLNREKCLKW AENARTLALP HSADDVAEAA IACAA* 

Computer analysis of this amino acid sequence gave the following results- 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 087 shows 83.9% identity over a 355 aa overlap with a predicted ORF (ORF 087 ne} 
from N. gonorrhoeae: v &J 

m087/g087 



551 
601 
651 
701 
751 
801 
851 
901 



51 
101 
151 
201 
251 
301 



m08 7 .pep 
g087 

m087 .pep 
g087 

m087 .pep 
g087 



10 20 30 40 50 en 

I I I I I I I I II I I I II II I I I I I : Mil l | | | | | | I I I I I I I | I I | | | | | | | | | 

MGGKTFMLMAGGTGGHIFPAIAVADSLRVRGHHVIWLGSI^ 

10 20 30 40 50 so 

70 80 90 100 HO 120 

IMI I INI I INI llhllMIIIIIIIIMNIMIIIIIIIIMNIM I 1 1 1 

KGIRGNGIKRKLMLPFTLYKTVREAQRIIRKHRVECTVIGF^ 

70 80 90 100 no 120 

130 140 150 160 170 inn 

IHEQNAVAGLS^SRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERFQGRE 

1 1 1 1 1 1 1 H 1 1 1 1 1 1 1 M 1 1 11 1 1 1 1 1 1 1 u 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n nYm ? 

IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERTO 

130 140 15 ° 160 170 180 
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190 200 210 220 229 

m087 . pep RLKI LWGGS LGADVLNKTVPHALALLPDNARPHMYHQSGRGKLG I LQA 

I I I I I I I I II I I II I M I I I hi I i I ! |:-| M ! M MhM ! II 
g08 7 RLKILVVGGSLGADVLNKTVPQALALLPEEVRPQMYHQSGRNKLGNLQADYDALGVKAEC 

190 200 210 220 230 240 



230 240 250 

mO 8 7 . pep AGLGALLVPYPHAVDDHQTANARFMVQAE 

I I I I I I I I I I II M Ml I M I I I I I I I I I 

g 0 8 7 VE FI TDMVS AYRDADLVI CRAGALT I AELTAAGLGALLVP Y PHAVDDHQTANARFMVQAE 

250 260 270 280 290 300 



260 270 280 290 300 310 

mO 8 7 . pep AGLLLPQTQLTAE KLAE I LGGLNRE KCLKWAENARTLALPHS ADDVAE AAI ACAAX 

I I I I I II I I II I I I I I I M I : I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M M 
gO 8 7 AGLLLPQTQLTAEKLAEILGSUJREKCLKWAENARTLALPHSADDVAEAAI ACAAX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 28 1>: 

a087 . seq 

1 ATGGGCGGTA AAACCTTTAT GCTGATGGCG GGCGGAACGG GCGGACATAT 

51 TTTCCCCGCG CTGGCGGTGG CGGATTCATT GCGCGCGCGC GGCCATCATG 

101 TAATTTGGCT GGGCAGCAAG GATTCGATGG AAG AG CG CAT CGTGCCGCAA 

151 TACGACATCC TGCTCGAAAC GCTGGCGATT AAAGGCGTGC GCGGCAACGG 

201 CATCAAACGC AAGCTGATGC TGCCGTTTAC TTTGTATCAA ACTGTCCGCG 

251 AAGCGCAGCA GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 

301 GGCGGCTTCG TTACCTTTCC CGGCGGTTTG GCGGCGAAGT TATTAGGCGT 

351 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGTTTG TCCAACCGCC 

4 01 ACCTGTCGCG CTGGGCGAAG CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 

451 AGCCACGAAG GCGGCTTGGT CGGCAACCCC GTCCGCGCCG ATATTAGCAA 

501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG GCGTGAAGGC CGTCTGAAAA 

551 TTTTGGTGGT CGGCGGCAGT TTGGGCGCGG ACGTTTTGAA CAAAACCGTA 

601 CCGCAGGCAT TGGCTTTGCT GCCCGACAAT GCGCGTCCGC AGATGTACCA 

651 CCAATCGGGA CGGGGCAAGC TGGGCAGCTT GCAGGCGGAT TACGACGCGC 

7 01 TGGGCGTGCA AGCGGAATGC GTGGAATTTA TTACCGATAT GGTGTCCGCC 

7 51 TACCGCGATG CCGATTTGGT GATTTGCCGT GCCGGCGCGC TGACGATTGC 

8 01 CGAGTTGACG GCGGCGGGAT TGGGTGCGTT GTTAGTGCCG TATCCTCACG 
851 CCGTTGATGA CCATCAAACC GCCAACGCGC GTTTTATGGT GCAGGCGGAG 
901 GCGGGATTGC TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA 
951 GATTCTCGGC GGCTTAAACC GCGAAAAATG CCTCAAATGG G C AG AAAACG 

1001 CCCGTACGTT GGCACTGCCG CACAGTGCGG ACGACGTTGC CGAAGCCGCG 

1051 ATTGCGTGTG CGGCGTAA 

This corresponds to the amino acid sequence <SEQ ID 282; ORF 087.a>: 

a087 .pep 

1 MGGKTFMLMA GGTGGHIFPA LAVADSLRAR GHHVIWLGSK DSMEERIVPQ 

51 YDILLETLAI KGVRGNGIKR KLMLPFTLYQ TVREAQQIIR KHRVECVIGF 

101 GGFVTFPGGL AA KLLGVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 

201 PQALALLPDN ARPQMYHQSG RGKLGSLQAD YDALGVQAEC VEFITDMVSA 

251 YRDADLVICR AG ALTIAELT AAGLGALLV P YPHAVDDHQT ANARFMVQAE 

301 AGLLLPQTQL TAEKLAEILG GLNREKCLKW AENARTLALP HSADDVAEAA 

351 IACAA* 



m087/a087 85.4% identity over a 355 aa overlap 



10 20 30 40 50 60 

m087 . pep MGGKTFMLXXGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYGIRLETLAI 

1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 M i I M 1 1 1 1 1 1 1 1 1 \ 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 

a087 MGGKTFMLMAGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYDILLETLAI 

10 20 30 40 50 60 
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70 80 90 100 HO 120 

m087 . pep KGVRGNGIKRKLMLPVTLYQTVREAQRIIRKHRVECVIGFGGFVTFPGGLAAKLLXVPIV 

I I I I I I I I I I I I I | I i | | | M I I I I: I I j | M I II I M II I I I I I I I I I MM 

KGVRGNGIKRKLMLPFTLYQTVREAQQIIRKHRVECVIGFGGFVTFPGGLAAKLLGVPIV 
70 80 90 100 110 120 



a087 



130 140 150 160 170 180 

m087.pep IHEQNAVAGLSNRHLSRWAKRVLYAFPiCAFSHEGGLVGNPVRADISNLPVPAERFQGREG 
1 1 1 M 1 1 I > I M M M M I M M M I I M M I I M M M M I M M I I M I E I I I II II I 
IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERFQGREG 
130 140 150 160 170 180 



a087 



190 200 210 220 230 240 

Iti087 . pep RLKILVVGGSLGADVLNKTVPHALALLPDNARPHMYHQSGRGKLGILQAXXXXXXXXXXX 
I I ! M M M M M M II I M I M I M I I M 1 If : M M I M M M Ml 
RLKILVVGGSLGADVLNKTVPQALALLPDNARPQMYHQSGRGKLGSLQADYDALGVQAEC 
190 200 210 220 230 240 



a087 



250 260 270 280 

m087 . pep XX XXXXXXXXXAGLGALLVPYPHAVDDHQTANARFMVQAE 

a087 V EFITDMVSAYRDADLVICRAGALTIAELTAAGLGALLVPY^ 

250 260 270 280 290 300 

290 300 310 320 330 

m087 . pep AGLLLPQTQLTAEKLAEILGGLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 
1 1 11 ' 1 I I I M M M M M I II M M M II M I I I M I M II I II I I II I I M M I 
3087 AGLLLPQTQLTAEKLAEILGGLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 

310 320 330 340 350 

The following partial DNA sequence was identified in Kgonorrhoeae <SEO ID 283>- 

g088 . seq ^ ' 

1 ATGTT TTTAT GGCTCGCACA TTTCAGCAAC TGGTTAACCG GTCTGAATAT 

51 TTTTCAATAC ACCACATTCC GCGCCGTTAT GGCGGCGTTG ACCGCCTTGG 

101 CGTTTTCCCT GATGTTCGGC CCGTGGACGA TACGCAGGCT GACCGCGCTC 

151 AAATGCGGGC AGGCAGTGCG TACCGACGGC CCGCAAACCC ACCTCGTCAA 

2 01 AAACGGCACG CCGACGATGG GCGGTTCGCT GATTCTGACC GCCATTACCG 
251 TGTCCACCCT GTTGTGGGGC AACTGGGCGA ACCCGTATAT CTGGATTCTC 

3 01 TTGGGCGTAC TGCTTGCCAC CGGTGCGCTC GGTTTTTACG ACGACTGGCG 
351 CAAAGTCGTT TATAAAGACC CCAACGGCGT GTCCGCCAAA TTCAAAATGG 
401 TGTGGCAGTC AAGCGTTGCC GTTatcgcCG GTttggcaTT GTTTTACctt 
451 gCcgcCAATT CCGCCAACAA TATTTTGATT GTCCCGtttT TCAAACAAAT 
501 CGCCCTGCCG CTGGGCGTGG TCGGCTTttt gGtgttgTCT TACCTGACCA 
551 TCGTCGGCAC ATCCAACGCC GTCAACCTCA CcgaCGGCTT GGACGGCCTT 
601 GCCGCcttcc cgttcgtcct cgttgccgcC GGGCTCGCCA ttttcgccTA 
651 CGTCAGCGGA CACTACCAAT TTTCCCAATA CCTCCAGCTT CCCTATGTCG 
701 CCGGCGCGAA CGAAGTCGCT ATATTCTGCA CCGCCATGTG CGGCGCGTGC 
751 CTCGGATTTT TGTGGTTCAA CGCCTATCCC GCGCAAGTCT TTATGGGCGA 
8 01 TGTCGGCGCG CTGGCATTGG GTGCCGCGCT CGGTaccGtt gCCGTcaTcg 
851 tCCGCCAAGA ATTTGTcctc gtcattaTGG GCGGTCTGTT cgtcgtagaa 
901 gccgtgTCCG TTATGCTTCa tgtcggCTGG TACAAGAAAA Ccaaaaaacg 
951 CATCTTcCTg acgGcaccga ttcatcacca ttaCCaactt cgatgCTGGa 

1001 aagaaacgca agtcgtcgtc CGTTtCTGGA TTAtTAccat cgtcgtggtt 
1051 tTgataggtt tGagtacccT caAAattcgc ggaaactatg ccgTCCGAAC 
1101 ACCTTTCAGA CGGCATTTGA ACGCGCAATA A 

This corresponds to the amino acid sequence <SEQ ID 284* ORF 088 ne>- 

g088.pep ' • & - 

1 MFLWLAHFSN WLTGLNIFQY T TFRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANPYIWIL 

101 LGVLLATGAL GFYDDWRKW YKDPNGVSAK FKMVWQSS VA VIA GIALFYL 

151 AANSANNILI VPFFKQI ALP LGWGFLVLS YLTTV GTfiUft wr.Tnr.T.nr.T 
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201 AAFPFVLVAA GLAIFA YVSG HYQFSQYLQL PYVAGANE VA IFCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

301 AVSVM LHVGW YKKTKKRIFL TAPIHHHYQL RCWKETQ WV RFWIITIVW 

351 LIGL STLKIR GNYAVRTPFR RHLNAQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 285>: 

m088 . seq 



1 


ATGTTTTTAT 


GGCTCGCACA 


TTTCAGCAnC 


TGGTTAACCG 


GTCTGAATnn 


51 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


101 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


151 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


201 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


251 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


301 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


351 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


401 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


451 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


501 


nnnnnnnnnn 


nnnGGCGTGG 


TCGGCTTTTT 


GGTGTTGTCT 


TACCTGACCA 


551 


TCGTCGGCAC 


ATCCAATGCC 


GTCAACCTCA 


CCGACGGCTT 


GGACGGCCTT 


601 


GCGACCTTCC 


CCGTCGTCCT 


CGTTGCCGCC 


GGCCTCGCCA 


TCTTCGCCTA 


651 


TGCCAGCGGC 


CACTCACAAT 


TTGCCCAATA 


CCTGCAATTA 


CCTTACGTTG 


701 


CCGGCGCAAA 


CGAAGTGGTG 


ATTTTCTGTA 


CCGCCATGTG 


CGGCGCGTGC 


751 


CTCGGTTTCT 


TGTGGTTTAA 


CGCCTATCCC 


GCGCAAGTCT 


TTATGGGCGA 


801 


TGTCGGTGCA 


TTGGCATTGG 


GTGCCGCGCT 


CGGTACCGTC 


GCCGTTATCG 


851 


TCCGCCAAGA 


GTTTGTCCTC 


GTCATTATGG 


GCGGATTATT 


TGTCGTAGAA 


901 


GCCGTATCCG 


TTATGCTTCA 


GGTTGGCTGG 


TATAAGAAAA 


CCAAAAAACG 


951 


CATCTTCCTG 


ATGGCGCCCA 


TCCATCACCA 


CTACGAACAA 


AAAGGCTGGA 


1001 


AAGAAAC CCA 


AGTCGTCGTC 


CGCTTTTGGA 


TTATTACCAT 


CGTCTTGGTG 


1051 


TTGATCGGTT 


TGAGTACCCT 


CAAAATCCGC 


TGAACCTATG 


CCGTCTGAAC 


1101 


ATCTTT CAGA 


CGGCATTTGA 


ACGCGCAATA 


A 





1 MFLWLAHFSN WLTGLNIFQY TT FRAVMAAL TAIiAFS LMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANPYIWIL 

101 LGVLiLATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA VIAGLALFYL 

151 AANSAN NILI VPFFKQI ALP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 AAFPFVLVAA GLAIFA YVSG HYQFSQYLQL PYVAGANE VA IFCTAMCGAC 

251 LGFLW FNAYP AQVFMG DVGA IALGAALGTV AVIV RQE FVL VIMGGLFWE 

3 01 AVSVMLHVGW YKKTKKRIFL TAPIHHHYQL RCWKETQVW RFWIITIVW 

351 LIGLSTLKIR GNYAVRTPFR RHLNAQ* 



This corresponds to the amino acid sequence <SEQ ID 286; ORF 088>: 

m0 8 8 . pep 

1 MFLWLAHFSX WLTGLNXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

51 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

101 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 ATFPWLVAA GLAI FA YASG HSQFAQYLQL PYVAGANE W IFCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

301 AVSVM LQVGW YKKTKKRIFL MAPIHHHYEQ KGWKETQ VW RFWIITIVLV 

351 LIGL STLKIR XTYAVXTSFR RHLNAQ* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from AT. gonorrhoeae 

ORF 088 shows 91 .7% identity over a 205 aa overlap with a predicted ORF (ORF 088.ng) 
from TV. gonorrhoeae: 

m088/g088 



10 20 30 

TnO 8 8 . pep GWGFLVLS YLTI VGTSNAVNLTDGLDGLA 

1 1 1 M 1 1 1 1 1 i I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i ! 

g088 IAGLALFYLAANSANNILIVPFFKQIALPLGWGFLVLSYLTIVGTSNAVNLTDGLDGLA 
150 160 170 180 190 200 
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m088 .pep 
g088 

m0 88 .pep 
g088 

m088 -pep 
g088 



40 50 60 70 80 90 

TFPWLVAAGLAIFAYASGHSQFAQYLQLPYVAGANEWIFCTAMCGACLGFLWFNAYPA 

MllliMIIIMM IhlllllllllllllhllllliiiiiiiiMiiiiii 

AFPFVLVAAGLAI FAYVSGHYQFSQYLQLPYVAGANEVAI FCTAMCGACLGFLWFNAYPA 
210 220 230 240 250 260 

100 HO 120 130 140 150 

MGGLFWEAVS VMXjQVGW YKKTKKR I FX>M 

mmiMMMMMIIIMMIIMIMIIMIIMIMllhlllMIIIIMII 

QVFMGDVGAI^GAALGTVAVIVRQEFVLVIMGGLFWEATO 

270 280 290 300 310 320 

160 170 180 190 200 

APIHHHYEQKGWKETQWVRFWIITIVLVLIGLSTLKIRXTYAVXTSFRRHLNAOX 

'Ii' ll|s : I UNI llll I II 1 1 hi IN 1 1 MM | :||| | | MM MM 

APIHHHYQLRCWKETQVVVRFWIITIVVVLIGLSTL^ 

330 340 350 360 370 



The following partial DNA sequence was identified in TV. meningitidis <SEO ID 287>- 

a088 . seq ^ 

1 ATGTTTTTAT GGCTCGCACA TTTCAGCAAC TGGTTAACCG GTCTGAATAT 

51 TTTTCAATAC ACCACATTCC GCGCCGTCAT GGCGGCGTTG ACCGCCTTGG 

101 CGTTTTCCCT GATGTTCGGC CCGTGGACGA TACGCAGGCT GACCGCGCTC 

151 AAATGCGGGC AGGCAGTGCG TACCGACGGT CCGCAAACCC ACCTCGTCAA 

201 AAACGGCACG CCGACGATGG GCGGTTCGCT GATTCTGACC GCCATTACCG 

251 TGTCCACCCT GTTGTGGGGC AACTGGGCAA ACCCGTATAT CTGGATTCTC 

301 TTGGGCGTAT TGCTCGCCAC GGGCGCACTC GGTTTTTACG ACGACTGGCG 

351 CAAAGTCGTC TATAAAGACC CGAACGGCGT GTCCGCCAAA TTCAAAATGG 

4 01 TGTGGCAGTC AAGCGTTGCC ATTATCGCCG GTTTGGCATT GTTTTACCTT 

451 GCCGCCAATT CCGCCAACAA TATTTTGATT GTCCCGTTCT TCAAACAAAT 

501 CGCCCTGCCG CTGGGCGTGG TCGGCTTTTT GGTGTTGTCT TACCTGACCA 

551 TCGTCGGCAC ATCCAATGCC GTCAACCTCA CCGACGGCTT GGACGGCCTT 

601 GCGACCTTCC CCGTCGTCCT CGTTGCCGCC GGCCTCGCCA TCTTCGCCTA 

651 TGCCAGCGGC CACTCACAAT TTGCCCAATA CCTGCAATTA CCTTACGTTG 

701 CCGGCGCAAA CGAAGTGGTG ATTTTCTGTA CCGCCATGTG CGGCGCGTGC 

751 CTCGGTTTCT TGTGGTTTAA CGCCTATCCC GCGCAAGTCT TTATGGGCGA 

801 TGTCGGTGCA TTGGCATTGG GTGCCGCGCT CGGTACCGTC GCCGTCATCG 

851 TCCGCCAAGA GTTTGTCCTC GTCATTATGG GCGGATTATT TGTCGTAGAA 

901 GCCGTATCCG TTATGCTTCA GGTCGGCTGG TATAAGAAAA CCAAAAAACG 

951 CATCTTCCTG ATGGCGCCCA TCCATCACCA CTACGAACAA AAAGGCTGGA 

1001 AAGAAACCCA AGTCGTCGTC CGCTTTTGGA TTATTACCAT CGTCTTGGTG 

1051 TTGATCGGTT TGAGTACCCT CAAAATCCGC TGAACCTATG CCGTCTGAAC 

1101 ACCTTTCAGA CGGCATTTGA ACGCGCAATA A 

This corresponds to the amino acid sequence <SEQ ID 288* ORF 088 a>- 

a088.pep ' 

1 MFLWLAHFSN WLTGLNIFQY TT FRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANPYIWIL 

101 LGVLLATGAL GFYDDWRKW YKDPNGVSAK FKMVWQSSVA IIAGLALFYL 



151 AA^|ANNILI VffEKQI ALP ^YY GFLVLS YLTTV RTSiia mn-iviT.^ 
251 

301 AVSVMLQVGW YKKTKKRIFL MAPIHHHYEQ KGWKETQVVV RFWIITIVLV 



ATFPVVLVAA GLAIFAYASG HSQFAQYLQL PYVAGANE W IFCTAM CGAC 

/SI T rz VT T«7 tr*XT J\ ~yr T> y\ /*vt ▼ t» m * _ — _ _ _ _ - 



LGFLWFNAYP AQVFMGD VGA LALGAALGTV AV T V R OF. TVT . VTMr.nT^nr 

AVSV MLQVGW YKKTKKRIFL MAPIHHH" ~ 

351 LIGL STLKIR *TYAV*TPFR RHLNAQ* 

m088/a088 99.5% identity over a 205 aa overlap 

m088 . pep ^XXxJJLcxxXXxJJL^ 

IAGLALFYIJU\NSANNILIVPFFKQIALPLGWGFLVL^ 
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150 160 170 180 190 200 
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I I I I I M I II I I I I I I 11 I I I I 1 I I I M I I I I I I 1 I I I I I I I I I I I I I I I I M I I I I I I I 
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270 280 290 300 310 320 

m088 . pep QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFVVEAVSVMLQVGWYKKTKKRIFLM 
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a088 QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFWEAVSVMLQVGWYKKTKKRIFLM 

270 280 290 300 310 320 

330 340 • 350 360 370 

m088 . pep APIHHHYEQKGWKETQWVRFWIITIVLVLIGLSTLKIRXTYAVXTSFRRHLNAQX 
I II I I I I I I I I I I I I I II M I M I I I I I I I I I I I I I I I I I I I II M I I I I I I I II 
a088 APIHHHYEQKGWKETQVWRFWIITIVLVLIGLSTLKIRXTYAVXTPFRRHLNAQX 

330 340 350 360 370 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 289>: 

g089 . seq 

1 ATGCCGCCCA AAATCACGAA GAGCGGGTTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATG AATACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACACGGT GCGGCAAGCC TTGGAAGGTT 

151 TTGACGTGTT CCAGCAATGC TTCGCGCGGC AAACCGACGG CCTCGCACAA 

201 AGCCACGGCA GCCATAACGT TGGCGGCGTT GTGCAAACCT TGCAGCGGGA 

251 TGTCTTGCGT AGAAATCAAA TCTTCATTGC CTTGTTTTAA ACAGCCCGTC 

3 01 CCGCGTTCCA ACCAAAAATC GGCTTCGTGT TCCAAGGAAA ACCGTTTCAC 

3 51 TTCACGCCCT GCCCGTTTCA TGGCGCGGCA GAACACGTCG TCCGCATTCA 

401 AAACCTGCAC TCCATCGCCA CGGAAAATCT CGGCTTTGGT ATGCGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 290; ORF 089.ng>: 

g089.pep 

1 MPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGKPWKV 

51 LTCSSNASRG KPTASHKATA AITLAALCKP CSGMSCVEIK SSLPCFKQPV 

101 PRSNQKSASC SKENRFTSRP ARFMARQNTS SAFKTCTPSP RKISALVCA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 29 1>: 

m089 . seq 

1 ATGCCGCCCA AAATCACkAw GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCAGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATA AACACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGGAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGAC AAACCGATGG CCTCACACAA 

201 AGCCACkGCA GCCATGACGT TAGCGGCGTT GTGCAkACCT TGCAACGGwA 

251 TGTCTTGCGT GACAATCAAA TCTTCATTGC CTTGTTTCAG GCGGCCTGTC 

301 TCGCGTTCCA ACCAGAAATC AGCTTCGTGT TCCAACGAAA ACCATTTTAC 

351 CTCGCGCCCG GCACGCTTCA TCGCGCGGCA GAACGCATCG TCCGCATTCA 

401 AAACCTGCAC GCCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCATAG 

This corresponds to the amino acid sequence <SEQ ID 292; ORF 089>: 

m0 8 9 . pep 

1 MPPKITXSGF CKP AIAAAVA PTFVPLLSSI NTTPFFSPIF STRCGRPWKV 

51 LTCSSNASRD KPMASHKATA AMTLAALCXP CNGMSCVTIK SSLPCFRRPV 

101 SRSNQKSASC SNENHFTSRP ARF I ARQNAS SAFKTCTPSP RKILALVCA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 089 shows 88.6% identity over a 149 aa overlap with a predicted ORF (ORF 089.ng) 
from N. gonorrhoeae: 

m089/g089 
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10 20 30 40 50 60 

m089.pep MPPKITXSGFCKPAIAAAVAPTFVPLLSSINTTPFFSPIFSTRCGRPWKVLTCSSNASRD 

1 1 1 1 1 1 IIIMIIIilllllllMIMhIIIMIIIIIIIIIhlMIMIIimi 

g0 8 9 MPPKITKSGFCKPAIAAAVAPTFVPLLSSMNTTPFFSPIFSTRCGKPWKVLTC^ 

10 20 30 40 50 60 

70 80 90 100 no 120 

mO 8 9 . pep KPMASHKATAAMTLAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 
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m08 9 . pep ARFIARQNASSAFKTCTPSPRKILALVCAX 
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g 0 8 9 ARFMARQNTSS AFKTCTPS PRKI S ALVCAX 

130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 293>- 

a089.seq ^ 

1 ATGCCGCCTA AAATCACGAA GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCA CCGACGTTCG TGCCTTTGCT GTCGTCGATG AACACCACGC 

101 CATTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGAAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGGC AAACCGACGG CTTCGCACAA 

201 GGCAACGGCA GCCATCACGT TAGTGGCGTT GTGCAAGCCT TGCAGCGGAA 

251 TATCTTGCGT GGCAATCAAA TCTT'CATTGC CTTGTTTCAG GCGACCTGTC 

301 TCACGTTCCA AC C AAAAAT C GGCTTCGTAT T CC AACG AAA ACCATTTCAC 

351 CTCGCGCCCG GCGCGCTTCA TCGCACGACA GAACGC AT CG TCCGCATTCA 

4 01 AAACCTGCAC ACCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 294; ORF 089.a>: 

a089.pep ' 

1 MPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGRP*KV 

51 LTCSSNASRG KPTASHKATA AITLVALCKP CSGISCVAIK SSLPCFRRPV 

101 SRSNQKSASY SNENHFTSRP ARFIARQNAS SAFKTCTPSP RKILALVCA* 

m089/a089 91 .9% identity over a 149 aa overlap 

10 20 30 40 50 60 

m089 . pep MPPKITXSGFCKPAIAAAVAPTFVPLLSSINTTPFFSPIFSTRCGRPWKVLTCSSNASRD 

IMI(IMIilllMIIMIM:||MlllMtMIMII MlifMUM 
MPPKITKSGFCKPAIAAAVAPTFVPLLSSMNTTPFFSPIFSTRCGRPXKVLTCSSNASRG 
10 20 30 40 50 60 



a089 



70 80 90 100 110 120 

m089.pep KPMASHKATAAMTLAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 
M I I M I I I I : I I : I I I 1 1 : I : I I i : I I i I | f | | t I II I I I M II I I I I I I I I | | M 
KPTASHKATAAITLVALCKPCSGISCVAIKSSLPCFRRPVSRSNQKSASYSNENHFTSRP 
70 80 90 100 HO 120 



a089 



130 140 150 

m089 . pep ARFIARQNAS SAFKTCT PS PRKI L ALVCAX 

M II I I I I I I I I I M I I I I M I i f I M I I I 
a089 ARFIARQNAS SAFKTCTPSPRKILALVCAX 

130 140 150 

The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 295>* 

g090 . seq ^ * 

1 ATGCGCGTAG TCGAGCAAAT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 
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51 TGTTCATCAC CGTCGCCGCA GTCGGGCGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAGCTGG AAAGCTCcca CACCCACACG TCCGCCTTTT TGCCTTCgcg 

151 ctgCAATtct gcctccaaga cgggcgtacc gatATTGCCC GCAATGAcgg 

2 01 tatccagccc gcacttgatg CAGAGatagc ggaccaggct ggttaccgTG 
251 GTTttgccgt tgctgCcggt aatcgCaatc accttgtcgC CGCGGCGGtt 

3 01 cAcaaTGTCc gccaGCAATt ggATGTCGCC TAgCACGCGC . ccgccgTTT 
3 51 TGCttga 

This corresponds to the amino acid sequence <SEQ ID 296; ORF 090.ng>: 

g090 .pep 

1 MRWEQIWA VEMVFGNVHH RRRSRAQAFG VFQLEAGKLP HPHVRLFAFA 

51 LQFCLQDGRT DIARNDGIQP ALDAEIADQA GYRGFAVAAG NRNHLVAAAV 

101 HNVRQQLDVA XHAXRRFA* 

The following partial DNA sequence was identified in AT. meningitidis <SEQ ID 297>: 

m090.seq 

1 ATGCGCATAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

51 TGTTCAGCAC CGTCGCCGCA GTCGGACG C A GGCTTTCGGT GTTTTCCAGT 

101 TGGAAGCTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

151 CTGCCATTCC GCCTCCAAAA CCGGCGTGCC GATATTGCCC GCGATAACGG 

2 01 TATCCAGCCC GCACTTGATA CAGAGATAGC CGACCAGGCT CGTTACCGTG 

2 51 GTTTTGCCGT TGCTGCCGGT AATCGCAATT ACCTTGTCGT CCCGGCGGTT 

3 01 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACG CGT .CCGCCGTTT 
3 51 TGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 298; ORF 090>: 

m090 .pep 

1 MR I VEQVWA VEMVFGNVQH RRRSRTQAFG VFQLEAGKLQ HPHVRLFAFA 

51 LPFRLQNRRA DIARDNGIQP ALDTEIADQA RYRGFAVAAG NRNYLWPAV 

101 HNVRQQFDVA QHAXRRFA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 090 shows 83.9% identity over a 1 18 aa overlap with a predicted ORF (ORF 090.ng) 
from N. gonorrhoeae: 

m090/g090 



m090 .pep 
g090 



m0 90 . pep 
g090 

70 80 90 100 110 



10 20 30 40 50 60 

MRI VEQVWAVEMVFGNVQHRRRSRTQAFGVFQLE AGKLQHPHVRLFAFALP FRLQNRRA * 
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MRWEQIWAVEMVFGNVHHRRRSRAQAFGVFQLEAGKLPHPHVRLFAFALQFCLQDGRT 
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DI ARDNGI QPALDTE IADQARYRGFAVAAGNRNYLVVPAVHNVRQQFDVAQHAXRRFAX 

II I I- I I M ! M : II II M IIIIIIIIMIhlh lllllllhlll IMIIMI 

DIARNDGIQPAIiDAEIADQAGYRGFAVAAGNRNHLVAAAVHNVRQQLDVAXHAXRRFAX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 299>: 

a090 . seq 

1 ATGCGCGTAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

51 TGTTCAGCAC TGTCGCCGCA GTCGGGCGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAACTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

151 CTGCAATTCC GCCTCCAAAA CCGGCGCGCC GATATTGCCC GCGATAACGG 

201 TATCCAGCCC ACACTTGATG CAGAGATAGC CGACCAGGCT CGTTACCGTG 

251 GTTTTGCCGT TGCTGCCGGT AATCGCAATC ACCTTGTCGC CGCGGCGGTT 

301 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT CCGCCGTTT 

351 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 300; ORF 090.a>: 
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a090.pep 

1 MRWEQVVVA VEMVFGNVQH CRRSRAQAFG VFQLETGKLQ HPHVRLFAFA 

51 LQFRLQNRRA DIARDNGIQP TLDAEIADQA RYRGFAVAAG NRNH LVAAAV 

101 HNVRQQFDVA QHAXRRFA* 



m09/a09o 91.5% identity over a 1 17 aa overlap 

10 20 30 40 50 60 

m090 . pep MRIVEQVVVAVEMVFGNVQHRRRSRTQAFGVFQLEAGKLQHPHVRLFAFALPFRLQNRRA 
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The following partial DNA sequence was identified in N. gonorrhoeae 

g090-l.seq This sequence contains multiple stop codons (not shown) 

This corresponds to the amino acid sequence < ORF 090-l.ng>: 

g090-l.pep (not shown) 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 2>- 

m090-l.seq ^ 

1 ATGACGGCGT TTGCATTTCA GACGGCATCA CAAAGCCTTA AACGCTTCGA 

51 TAAACACTTC CGAACGGTGC GCGTAGCCTT TGAACATATC AAAGCTCGCG 

101 CAGGCGGGGC TGAGCAACAC AATATCGCCT GCTTCGGCTT GGGCATATGC 

151 CGTCTGAACG GCTTCTCCCA AAGTGGCGCA GTCGGTCATA TTCAAGCCGC 

201 AGCCGTCCAA ATCGCGGCGG ATTTGCGGCG CATCGACACC AATCAAGAAC 

251 ACGCCTTTTG CCTTGCCTAC CAGTGCATCG CGCAGGGGCG TGAAGTCCTG 

301 CCCTTTACCC ATGCCGCCCA AAATCACGAA GAGCGGATTT TGCAAACCGG 

351 CAATCGCGGC GGCAGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATA 

4 01 AACACCACGC CGTTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC 

4 51 TTGGAAGGTT TTGACGTGTT CGAGCAATGC TTCGCGCGAC AAACCGATGG 

501 CCTCACACAA AGCCACGGCA GCCATGACGT TAGCGGCGTT GTGCAGACCT 

551 TGCAACGGAA TGTCTTGCGT G AC AAT C AAA TCTTCATTGC CTTGTTTCAG 

601 GCGGCCTGTC TCGCGTTCCA ACCAGAAATC AGCTTCGTGT TCCAACGAAA 

651 ACCATTTTAC CTCGCGCCCG GCACGCTTCA TCGCGCGGCA GAACGCATCG 

701 TCCGCATTCA AAACCTGCAC GCCGTCGCCA CGGAAAATCT TGGCTTTGGT 

7 51 ATGCGCATAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

801 TGTTCAGCAC CGTCGCCGCA GTCGGACGCA GGCTTTCGGT GTTTTCCAGT 

851 TGGAAGCTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

901 CTGCCATTCC GCCTCCAAAA CCGGCGTGCC GATATTGCCC GCGATAACGG 

951 TATCCAGCCC GCACTTGATA CAGAGATAGC CGACCAGGCT CGTTACCGTG 

1001 GTTTTGCCGT TGCTGCCGGT AATCGCAATT ACCTTGTCGT CCCGGCGGTT 

1051 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT CCGCCGTTTT 

1101 GCTTGAACGC CTCAATATCC GGCTGCCGCT CGCTGATGCC GGGACTGAGA 

1151 GCCAGAATAT CGAAACCGTT GTCCAGCGCA TCTTTCAGAC GGCCCGTGTA 

1201 AAACACCAAC CCGTCAAACA TCTTACCGAT TTGCGACACG CGTTCCGGCT 

1251 TCAGCTCCGC ATCATACGCA GCAACCTCCG CGCCGTTTTT GCGCAGGTAG 

13 01 GCAATCATGG AAATACCCGT ACCGCCGAGT CCGGCGACGA GGATTTTTTT 

1351 GTTTTGAAAA GTCATTTTGG TTTGTCCTAA 

This corresponds to the amino acid sequence <SEQ ID 3; ORF 090- 1>* 

m090-l.pep 

1 MTAFAFQTAS QSLKRFDKHF RTVRVAFEHI KARAGGAEQH NIACFGLGIC 

51 RLNGFSQSGA VGHIQAAAVQ IAADLRRIDT NQEHAFCLAY QCIAQGREVL 

101 PFTHAAQNHE ER1LQTGNRG GSRADIRAFA VVDKHHAVFL ADFFHAVRQA 

151 LEGFDVFEQC FARQTDGLTQ SHGSHDVSGV VQTLQRNVLR DNQIFIALFO 

201 AACLAFQPEI SFVFQRKPFY LAPGTLHRAA ERIVRIQNLH AVA TENLGFG 

251 MRIVEQVVVA VEMVFGNVQH RRRSRTQAFG VFQLEAGKLQ HPHVRLFAFA 

301 LPFRLQNRRA DIARDNGIQP ALDTEIADQA RYRGFAVAAG NRNYLVVPAV 

351 HNVRQQFDVA QHASAVLLER LNIRLPLADA GTESQNIETV VQRI FQTARV 

401 KHQPVKHLTD LRHAFRLQLR IIRSNLRAVF AQVGNHGNTR TAESGDEDFF 

4 51 VLKSHFGLS* 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 303>: 

g091 . eeq 

1 ATGGAAATAC CCGTGCCGCC AAGTCCGGCG ACGAGGATTT TTTTGTTTGA 

51 AAGTCATTTT GGTTTTGTCC TAAAACAAAT CATATTGGGC AGGAGACGTC 

101 CGCCCTTGCC CAAGCCGCTT TCAGACGGCA TCGCGAGCCG ATTAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCT GTTTTGGTCT CCGTTTTGAC 

2 01 AAGCCTTGCC AAGCCATTGT TGAGCGAGCG CAAGGTCTTG GCGCACGCCG 
251 CGTCCATCGT AATACATCAA GCCCAAATTG TATTGGGCTT GGGCATCCCC 

3 01 TTGTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 304; ORF 091.ng>: 

g091 .pep 

1 MEIPVPPSPA TRIFLFESHF GFVLKQTILG RRRPPLPKPL SDGIASRLIT 
51 RLQ ALVIVAA VLVSVLTSLA KPLLSERKVL AHAAS IVIHQ AQIVLGLGIP 
101 LF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 305>: 

m091 . seq 

1 ATGGAAATAC CCGTACCGCC GAGTCCGGCG ACGAGGATTT TTTTGTTTGA 

51 AAAGTCATTT TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGAGATGTC 

101 CGCCCCTGCC CAAGCCGCTT TCAGACGGCA TCGCGAGCTG TTCAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCC GTCTTGGTCT CCGTTTTGAC 

201 AAGCCTTGCC AAACCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

251 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGCTT GGGCTACCCC 

301 CTGCGC. . . 

This corresponds to the amino acid sequence <SEQ ID 306; ORF 09 1>: 

m091 .pep 

1 MEIPVPPSPA TRIFLFEKSF WFVLKQIILS RRCPPLPKPL SDGIASCSIT 
51 RLQ ALVIVAA VLVSVLTSLA KPFLCKGAVL AHAASF GIHH AQIVLGLGYP 
101 LR. 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 091 shows 84.2% identity over a 101 aa overlap with a predicted ORF (ORF 09Lng) 
from TV. gonorrhoeae: 

m09l/g091 



10 20 30 40 50 60 

m091.pep ME I PVPPS PATRIFLFEKSFWFVLKQIILSRRCPPLPKPLSDGIASCSITRLQALVIVAA 

IMIMIIIIIMMIh I lllllllhll lllllllllllll llllllllllll 

g091 ME I PVPPS PATRI FLFESHFGFVLKQI ILGRRRPPLPKPLSDGI ASRLI TRLQALVIVAA 

10 20 30 40 50 60 



70 80 90 100 

m091 . pep VLVS VLTS LAKP FL C KGAVLAHAAS FGI HHAQ I VLGLGY P LR 

I I M II I I II II : I : MIIMI: Ihllllllll II 
g 0 9 1 VLVSVLTSLAKPLLSERKVLAHAAS I VIHQAQI VLGLGI PLFX 

70 80 90 100 



The following partial DNA sequence was identified in K meningiditis <SEQ ID 307>: 

a091.seq 

1 ATGGAAATAC CCGTGCCGCC AAGTCCGGCG ACGAGGATTT TTTTGTTTTG " 

51 GAAATCATTT TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGGGATGTC 

101 TGATCCTGCT CAAGCCGCTT TCAGACGGCA TCGCGAGCTG TTCAATAACC 

151 CGCTTTCAGG CGTTGGTCAT TGTCGCAGCT GTCTTGGTAT CCGTTTTGAC 

201 AAGCCTTGCC AAGCCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

251 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGC 
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This corresponds to the amino acid sequence <SEQ ID 308; ORF 09La>: 

a091.pep 

1 MEIPVPPSPA TRIFLFWKSF WFVLKQIILS RGCLILLKPL SDGIASCSIT 
51 RFQ ALVIVAA VLVSVLTSLA KPFLCKGAVL AHAAS FGIHH AQIVLG 

m09i/a09i 93.8% identity over a 96 aa overlap 

10 20 30 40 50 60 

m091.pep MEIPVPPSPATRIFLFEKSFWFVLKQIILSRRCPPLPKPLSDGIASCSITRLQALVIVAA 
f M ! I I I I I I I II I | | I M I I I I I I M I I | | f M I I M I I f I I | | | : M I [ II M 
a091 MEIPVPPSPATRIFLFWKSFWFVLKQIILSRGCLILLKPLSDGIASCSITRFQALVIVAA 

10 20 30 40 50 60 

70 80 90 100 

m0 91 . pep VLVSVLTSLAKPFLCKGAVLAHAAS FGIHHAQIVLGLGYPLR 

I I M I I I I I I I I I I II 1 I I I I I I If | | | M I I ! I I I 
a091 VLVSVLTSLAKPFLCKGAVLAHAAS FGIHHAQIVLG 

70 80 90 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEO ID 309>- 

g092.seq ^ 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGTGCGC 

51 AAACGGTCAG AC CTTTAAAA TAACGCCTTT ACGCACTAAA AACCAACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTAA GCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGTATCGCC GAAGTCTTGC ACAATTTGGG 

201 CTTTAAAGTT TCCGGTTCGG ATCAGGCGCG AAATGCCGCT ACCGAGCATT 

251 TGAGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC AGAACACGTT 

3 01 AACGGTgcgg ATGTCGTCGT TGCCTCTACC GCCGT CAAGA AAGAAaatcC 

351 CGAAGTtgtC gcTGCGTTGG AGCGGCAAAT TCCCGTTATT CCGCGCGCCT 

401 TGATGCTGGC AGAGCTGATG CGCTTCCGTG ACGgcatcgc cattgccggT 

451 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 

501 GGCAGGACTC GACCCCACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 

551 GCACCAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 

601 GAATCCGATG CCTCTTTCCT ACATCTGACC CCGATTATGT CCGTCGTTAC 

651 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGC GTCGAAAAAC 

701 TGCATCAGGC GTTTATCGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 

751 GCCTTTTTGT GTGTTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 

801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 

851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATTCAC CGTCCATGTT 

901 CAAATGAAAG GACATGAGCA GGGGTCGTTT GAAGTCGTGC TGAATATGCC 

951 CGGCAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGc gtggcGCTgg 

1001 aagtcGgCGC ATcggttgAA GCGAtcCAAA AaggCTTGCT CGGCTTTGAA 

1051 GGCGTCGGCC GCCGCTTCCA AAAATAcggc gacatCAagt tgccaaacgg 

1101 cggGaccgCT TTgctGGTGG ACGATTAcgg ACACCACCCC GTCGAAATGG 

1151 CGGcaaccct tgccgcTGCA CGCGGCGCGT ATCCGGAAAA acgtTTGGTG 

1201 CtcgCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 

1251 CTTTACCAAA GTACTCAATA CCGTTGatgC GCTGGTACTG ACCGAAGTTT 

13 01 AtgccgccgG CGAAGAGCCG GTTGCCGCCG CCGactcCCG CGCCTTGGCG 

13 51 CGTGCTATCC GCGTATTGGG CAAACTTGAG CCGATTTACT GCGAAAatgt 

14 01 cgccgACCTG CCGCAAATGC TGATGAATGT TTTACAGGAT Ggcgatgttg 
1451 tgttgAATAT GggTgcggga agcatcaacc gcgttccttc cgcgctgttq 
1501 gaattgtcga AACAGAtttg A 

This corresponds to the amino acid sequence <SEQ ID 310* ORF 092 ne>- 
g092.pep ' 6 * 

1 MFFISIRYIF VRKLWCANGQ TFKITPLRTK NQPERNIMMK NRVSNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLSSLGIQ VYPGHTAEHV 

101 NG AD WV AS T AVKKENPEW AALERQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TASILGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 
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251 AFLCVDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 QMKGHEQGSF EWLNMPGRH NVLN ALAAIG VALEVGASVE AI QKGLLGFE 

3 51 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLAAA RGAYPEKRLV 
401 LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP VAAADSRALA 

4 51 RAIRVLGKLE PIYCENVADL PQMLMNVLQD GDWLNMGAG SINRVPSALL 
501 ELSKQI* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 31 1>: 

m092 . seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGCGCGC 

51 AAACGGTCAG CCCTTTAAAA TAACGCCTTT ACGCATCGAA AATCCACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTTA CCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGCATCGCC GAAGTCTTGC ACAATT TGGG 

201 CTTTAAAGTT TCCGGTTCGG ATCAgGCGCG AAATGCCGCT ACCGAGCATT 

251 TGGGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC CGAACACGTT 

3 01 AACGGTGCGG ATGTCGTCGT TACCTCTACC GCCGTCAAAA AAGAAAATCC 
351 CGAAGTTGTC GCTGCGTTGG AGCAGCAAAT TCCCGTTATT CCGCGCGCCC 
401 TGATGTTGGC GGAGTTGATG CGCTTCCGTG ACGGCATCGC CATTGCCGGC 

4 51 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 
501 GGCAGGACTT GACCCGACTT TCGTTATCGG CGGCAAACTC AACGC CGCAG 

5 51 GCACTAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 

6 01 GAGTCGGATG CATCCTTTCT GCAC CTGACA CCGATTATGT CCGTCGTTAC 
6 51 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGC GTCGAAAAAC 
701 TGCATCAGGC GTTTATCGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 
751 GCCTTTTTGT GTATTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 
801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 
851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATTCAC CGTCCATGTT 
901 CAAATGAAAG GACATGAGCA GGGGTCGTTT GAAGTCGTGC TGAATATGCC 
951 CGGCAGACAC AACGTG CTG A ACGCATTGGC AGCCATCGGC GTGGCGCTGG 

1001 AAGTCGGCGC ATCGGTTGAA GCGATCCAAA AAGGCTTGCT CGGCTTTGAA 

1051 GGCGTCGGCC GCCGCTTCCA AAAAT ACGG C GACATCAAGT TGCCAAACGG 

1101 CGGGACCGCG CTCTTGGTGG ACGACTACGG ACACCACCCC GTCGAAATGG 

1151 CGGCGACCCT TGCCGCCGCA CGCGGCGCGT ATCTGGAAAA ACGTTTGGTA 

12 01 CTCGCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 
1251 CTTTACCAAA GTCCTCAATA CCGTTGACGC GCTGGTGCTG ACCGAAGTTT 
1301 ATGCCGCCGG TGAAGAGCCG ATTGCCGCCG CCGATTCCCG CGCTCTTGCC 

13 51 CGCGCCATCC GCGTGTTGGG CAAACTCGAG CCGATTTACT GCGAAAACGT 

14 01 TGCCGATCTG CCCGAAATGC TGTTGAACGT TTTGCAGGAC GGCGACATCG 
14 51 TGTTGAATAT GGGCGCGGGA AGCATCAACC GCGTCCCCGC CGCGCTGCTG 
1501 GCATTGTCGA AACAGATTTG A 

This corresponds to the amino acid sequence <SEQ ID 312; ORF 092>: 

m092 -pep 

1 MFFISIRYIF VRKLWRANGQ PFKITPLRIE NPPERNIMMK NRVTNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLGSLGIQ VYPGHTAEHV 

101 NGADVWTST AVKKENPEW AALEQQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TAS I LGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 

251 AFLCIDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

3 01 QMKGHEQGSF EWLNMPGRH NVLN ALAAIG VALEVGASVE AI QKGLLGFE 

3 51 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLAAA RGAYLE KRLV 

4 01 LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP IAAADSRALA 
451 RAIRVLGKLE PIYCENVADL PEMLLNVLQD GD I VLNMGAG SINRVPAALL 
501 ALSKQI* 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N. gonorrhoeae 

ORF 092 shows 96.6% identity over a 506 aa overlap with a predicted ORF (ORF 092.ng) 
from N. gonorrhoeae: 

m092/g092 

10 20 30 40 50 60 

m092 .pep MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVGIGGVGMSGIA 
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MMIIIMIIIIII INI MIIMI H MllllllllhlllllllMMMMI 

9092 MFFISIRYIFVRKLWCANGQTFKITPLRTKNQPEIOTIMMKNRVSNIHFVGIGGVGMSGIA 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 92 . pep EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADWVTSTAVKKENPEW 

i M ! I M 1 11 I M 1 1 1 I 1 1 1 1 1 f ] ^ I ! 1 1 1 1 1 1 1 I I M 1 1 M 1 1 1 1 1 ^ I ! 1 1 I 1 1 1 1 1 1 1 

g092 EVLHNLGFKVSGSDQARNAATEHLSSLGIQVYPGHTAEHVNGADVWASTAVKKENPEW 

70 80 90 100 110 120 

130 140 150 160 170 180 

m092 . pep AALEQQIPVIPRAI^LAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 

MMMIMMMMMIMMMMMMMIIIMMMIMMMIMMMMMI 

g 0 9 2 AALERQ I PV I PRALMLAELMRFRDGI AI AGTHGKTTTTS LTAS I LGAAGLDPTFVI GGKL 

130 140 150 160 170 180 

190 200 210 220 230 240 

mO 92 . pep NAAGTNARLGKGEYIVAEADESDASFLHLTPIMSWTNIDEDHMDTYGHSVEKIiHQAFID 

M ! M 1 1 1 M 1 1 1 1 I I M I M 1 1 E 1 M 1 1 1 1 1! M 1 1 M 1 1 M M i 1 1 1 IE 1 1 1 ! I I M I 

g092 NAAGTNARLGKGE Y I VAEADE SDAS FLHLTP I MSWTN I DEDHMDT YGHS VE KLHQAF I D 

190 200 210 220 230 240 

250 260 270 280 290 300 

mO 92 . pep FIHRMPFYGKAFLCIDSEHVRAI LPKVSKPYATYGLDDTAD I YATD I ENVGAQMKFTVHV 

M I 1 1 1 1 f 1 1 1 I M = ! I I M I M ! ! I i f E E M I ! If 1 1 1 1 1 1 1 1 I M I I 1 1 I ] I M i I 1 1 

9 0 9 2 FI HRMPF YGKAFLCVDSEHVRAI LPKVS KP YATYGLDDTAD I YATD I ENVGAQMKFTVHV 

250 260 270 280 290 300 

310 320 330 340 350 360 

mO 9 2 . pep QMKGHEQGS FE WLNMPGRHNVLN ALAAI GVALEVGAS VEAI QKGLLGFEGVGRRFQKYG 

IMIIIIIMMIIIIilllllllllMIIIIIMIIMIIIIIMIIIIIIIIIMIII 

g 0 9 2 QMKGHEQGS FE WLNMPGRHNVLNALAAI GVALEVGAS VEAI QKGLLGFEGVGRRFQKYG 

310 320 330 340 350 360 

370 380 390 400 410 420 

mO 92 . pep DIKLPNGGTALLVDDYGHHPVEMAATLAAARGAYLEKRLVIiAFQPHRYTRTRDLFEDFTK 

IIIMMIMIMIIIIIIIIIIIIIIIIIMII IIIII1IMIIIIIIIIIIMMII 

g 0 9 2 D I KLPNGGTALLVDD YGHHPVEMAATLAAARGAYPEKRLVLAFQPHRYTRTRDLFEDFTK 

370 380 390 400 410 420 

430 440 450 460 470 480 

m092 . pep VLNTVDALVLTEVYAAGEEPI AAADSRALARAI RVLGKLEPI YCENVADLPEMLLNVLQD 

IMIIIMIIIIMIIIMhIIIIMIIIIIIMMIIIIIIMIIIIIhlhlllll 

g092 VLNTVDALVLTEVYAAGEE PVAAADSRALARAI RVLGKLE P I YCENVADLPQMLMNVLQD 

430 440 450 460 470 480 

490 500 
m092 . pep GD I VLNMGAGS INRVPAALLALS KQIX 

Ihllllllllllllhlll IIMM 

g092 GD WLNMGAGS I NRVPS ALLELS KQI X 

490 500 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 313>: 

a092 . seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGCGCGC 

51 AAACGGTCAG CCCTTTAAAA TAACGCCTTT ACGCATCGAA AATCCACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTGA CCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGTATCGCC GAAGTCTTGC ACAATTTGGG 

201 TTTTAAAGTT TCCGGTTCGG ATCAGGCGCG AAATGCCGCT ACCGAGCATT 

251 TGGGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC AGAACACGTT 
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301 AACGGTGCGG ATGTCGTCGT TACCTCTACC GCCGTCAAAA AAGAAAATCC 

351 CGAAGTTGTC GCTGCGTTGG AGCAGCAAAT TCCCGTTATT CCGCGCGCCC 

4 01 TGATGTTGGC GGAGTTGATG CGCTTCCGTG ACGGCATCGC CATTGCCGGC 

4 51 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 

501 GGCAGGACTT GACCCGACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 

551 GCACCAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 

601 GAGTCGGATG CATCCTTTCT GCACCTGACA CCGATTATGT CCGTCGTTAC 

651 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGT GTTGAGAAGC 

7 01 TGCATCAGGC GTTTATCGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 

7 51 GCCTTTTTGT GTATTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 

801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 

851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATTCAC CGTCCATGTT 

901 CAAATGAAAG GACATGAGCA GGGGTCGTTT GAAGTCGTGC TGAATATGCC 

951 CG G C AG AC AC AACGTGCTGA ACGCATTGGC AGCCATCGGC GTGGCGCTGG 

1001 AAGTCGGCGC ATCGGTTGAA G CGATCCAAA AAGGCTTGCT CGGCTTTGAA 

1051 GGTGTCGGCC GCCGCTTCCA AAAATACGGC GACATCAAGT TGCCAAACGG 

1101 TGGAACCGCG CTCTTGGTGG ACGACTACGG ACACCACCCC GTCGAAATGG 

1151 CGGCGACCCT TTCCGCCGCA CGCGGCGCGT ATCCGGAAAA ACGTTTGGTA 

1201 CTCGCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 

1251 CTTTACCAAA GTCCTCAATA CCGTTGACGC GCTGGTGCTG ACCGAAGTTT 

1301 ATGCCGCCGG TGAAGAGCCG ATTGCCGCCG CTGATTCCCG CGCTCTTGCC 

1351 CGCGCCATCC GCGTGTTGGG CAAACTCGAG CCGATTTACT GCGAAAACGT 

14 01 TGCCGATCTG CCCGAAATGC TGTTGAACGT TTTGCAGGAC GGCGACATCG 

14 51 TGTTGAATAT GGGTGCGGGA AGCATCAACC GCGTCCCCGC CGCGCTGCTG 

1501 GAATTGTCGA AACAGATTTG A 



This corresponds to the amino acid sequence <SEQ ID 314; ORF 092. a>: 

a0 92 . pep 

1 MFFISIRYIF VRKLWRANGQ PFKITPLRIE NPPERNIMMK NRVTNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLGSLGIQ VYPGHTAEHV 

101 NGADVWTST AVKKEN PEW AALEQQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TASILGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 

251 AFLCIDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 QMKGHEQGSF EWLNMPGRH NVLN ALAAIG VALEVGASVE AI QKGLLGFE 

351 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLSAA RGAYPEKRLV 

401 LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP IAAADSRALA 

451 RAIRVLGKLE PIYCENVADL PEMLLNVLQD GDIVLNMGAG SINRVPAALL 

501 ELSKQI* 



m092/a092 99.4% identity over a 506 aa overlap 

10 20 30 40 50 60 

m0 92 .pep MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVGIGGVGMSGIA 
I M I I I I I I I I I I I I I I M I I I I I I I I I I I I M | i I M I I I | j I i ft | | t | | | | | | | ! | | 
a092 MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVG IGGVGMSGIA 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 92 . pep EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADVWTSTAVKKENPEW 
I 1 I I I I ! I I I I I I I I E I I ! I I I I I I I I I I I I ! I I I I I I II I I i I 1 I M i I I 1 I I I I I t M 
a092 EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADWVTSTAVKKENPEW 

70 80 90 100 110 120 

130 140 150 160 170 180 

m092 .pep AALEQQIPVI PRALMLAELMRFRDGIAIAGTHGFCTTTTSLTASILGAAGLDPTFVIGGKL 

I f I I M I I I I I I I I I I I I I I I It I I I ( I I | | | | | | j | | | | | | | M M I I I I II I I I I I I | 
a092 AALEQQIPVIPRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m092 . pep NAAGTNARLGKGEYIVAEADESDASFLHLTPIMSWTNIDEDHMDTYGHS VEKLHQAFID 

M M I I I I I I I I I I I M I I I M I t I I I I I M I I I I I I I I I t I | | | 1 I it 1 I I I I I I I I I I 
a092 N AAGTN ARLGKGE Y I VAE ADE S DAS FLHLT P IM S WTN I DE DHMDT YGH S VEKLH QAFI D 
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190 210 220 230 240 



m092 .pep 
a092 



250 260 270 280 290 300 

FIHRMPFYGKAFLCIDSEHVRAILPKVSKPYATYGLDDTADIYATDIENVGAQMKFTVHV 
M 1 1 1 1 1 M 1 I I ' 1 M M I M f M I I M I I M I [ I I M M | | ] | 1 | j ] | | M | M , , , M 

FIHRMPFYGKAFLCIDSEHVRAILPKVSKPYATYGLDDTADIYATDIENVGAQMKFTVHV 
250 260 270 280 290 300 

310 320 330 340 350 360 

m092 . pep QMKGHEQGSFEVVLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFOKYG 

I 11 1 1 ' 1 1 1 I ' I ' ' I I I f I I I I II I II I I M I M I I } I ] 1 i | J M | M II I it II I I I I I 

QMKGHEQGSFEWLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 
310 3 20 330 340 350 360 

370 380 390 400 410 420 

DIKLPNGGTALLVDDYGHHPVEMAATLAAARGAYLEKRLVLAFQPHRYTRTRDLFEDFTK 

II HIMMIMIIIIIIIIMIIIM:M|III I I I I I I I M I I I I I I I I ! I I f I I f I 
DIKLPNGGTALLVDDYGHHPVEMAATLSAARGAYPEKRLVLAFQPHRYTRTRDLFEDFTK 

370 380 390 400 410 420 

430 440 450 460 470 480 

VLNTVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPIYCENVADLPEMLLNVLQD 
M M 1 M M 1 1 1 M I I I M I I I I I I I | | | M I M I I I I M I M I M I I I M II II I I I II 
VLNTVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPIYCENVADLPEM^ 

430 440 450 460 470 480 

490 500 
m092 .pep GDI VLNMG AG S INR VPAALLALS KQ I X 

I I M II I I I II | | N | | M I II I I | | 
G D I VLNMGAG S INR VPAALLE LS KQI X 
490 500 



a092 

m092 .pep 
a092 

m092 .pep 
a092 



a092 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 3 1 5>- 

g093 . seg ^ 

1 a TGCAGAAT t ttgGCAAAGT ggccgtATTG ATGGGtggtT TTTCCAGCGA 

51 ACGAGAaatc tcgcTGGACA GCgGTACCGC CATTTTGAAC GCCTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGACC CTAAGGAAAC GCCGTTATCC 

151 GAACTGAAGG AGCGGGGCTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCCTCCG CCATCGGCAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTACCCGTTC CCGAGTTCGC 

351 CGTACTGTAC GATGATACCG ATTTCGATGC CGTCGAAGAA AAATTGGGTC 

401 TGCCGATGTT TGTGAAGCCG GCGGCCGAAG GCAGCAGCgt cggcgtggta 

451 aAAGTCAAAG AAAaaggccg TCTGAAAAGC GTTtacgaag aatTGAaaCA 

501 CCTTcagggg cgaAAtcatt gccgAacgTT TTATCGGCGG CGGCGAATAT 

551 TCCTGCCCCG TCCTGAACGG CAAAGGGCTG CCCGGCATAC ACATCATCCC 

601 CGCAACCGAG TTTTACGAct acgaagccaa GtacaaCCGA GACGAcacca 

651 tttaTCAATG TCCTTCGGAA GATTTGACCG AAGCCGAAGA AAGCCTGATG 

701 CGCGAACTGG CGGTTCGCGG CGCACAGGCA ATCGGTGCGG AAGGCTGCGT 

751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACT CT AT CTGTTGGAAA 

801 TCAACACCCT GCCCGGTATG ACCGGCCATA G 

This corresponds to the amino acid sequence <SEQ ID 316' ORF 093 ne>- 

g093.pep ' B ' 

1 MQNFGKVAVL MGGFSSERE I SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKERGFQTA FNILHGTYGE DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLY DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKEKGRLKS VYEELKHLQG RNHCRTFYRR RRIFLPRPER QRAARHTHHP 

201 RNRVLRLRSQ VQPRRHHLSM SFGRFDRSRR KPDARTGGSR RTGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDRP* 

The feUj^partial DNA sequence was identified in TV. meningitidis <SEQ ID 3 1 7>: 
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1 ATGCAGAATT TTGGCAAAGT GGCCGTATTG ATGGGCGGTT TTTCCAGCGA 

51 ACGAGAAATC TCGCTGGACA GCGGCACCGC CATTTTGAAT GCTTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGATC CTAAAGAAAC CCCATTGTCT 

151 GAATTGAAGG CACAAGGTTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCrAA GACGGGGCGG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCATCCG CCATCGGCAT GGACAAATAC 

3 01 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTGCCCGTTC CCGAGTTCGC 
351 CGTCCTGCAC GACGACACTG ATTTCGATGC CGTCGAAGAA AAATTGGGCC 

4 01 TGCCGATGTT TGTGAAACCG GCGGCCGAAG GCAGCAGCGT AGGCGTGGTA 
4 51 AAAGTCAAAG GAAAAGGCCG TCTGAAAAGC GTTTACGAAG AATTGAAACA 
501 CCTTCAGGG. CGAAATCATT GCCGAACGTT TTATCGGCGG CGG CGAATAT 
551 TCCTGCCCCG TCCTGAACGG CAAAGGGCTG CCCGGCATAC ACATCATTCC 
601 CGCAACCGAG TTTTACGACT ACGAAGCCAA GTACAACCGC GACGACACCA 
651 TTTATCAATG TCCTTCGGAA GATTTGACCG AAGCCGAAGA AAGCCTGATG 
701 CGCGAACTGG CGGTTCGCGG CGCGCAGGCA ATCGGTGCGG AAGGCTGCGT 
751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 
801 TCAACACCCT GCCCGGTATG ACGAGCCATA G 

This corresponds to the amino acid sequence <SEQ ID 318; ORF 093>: 

m093 . pep 

1 MQNFG KVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKAQGFQTA FNILHGTYGX DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKGKGRLKS VYEELKHLQX RNHCRTFYRR RRIFLPRPER QRAARHTHHS 

201 RNRVLRLRSQ VQPRRHHLSM SFGRFDRSRR KPDARTGGSR RAGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDEP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 093 shows 96.7% identity over a 276 aa overlap with a predicted ORF (ORF 093. ng) 
from N. gonorrhoeae: 

m093/g093 

10 20 30 40 50 60 

MQNFGKVAVLMGGFSSEREISLDSGTAIIjNALKSKGIDAYAFDPKETPLS ELKAQGFQTA 

I M II I i MM i ; Ml M M I M I M M MM I MM MM II I III II I Ml MMIl 

MQNFGKVAVLMGGFSSEREI SLDSGTAI IjNALKSKGIDAYAFDPKETPLSELKERGFQTA 
10 20 30 40 50 60 

70 60 90 100 110 120 

FNI LHGTYGXDGAVQGALEIiLGI PYTGSGVAASAIGMDKYRCKLI WQALGLPVPEFAVLH 

MIMIMI I 1 1 1 1 I 1 1 1 I I I 1 1 I I I I M I I I 1 1 1 1 II I I 1 1 I I I I I I 1 1 I I i I I I I ] : 

FNILHGTYGEDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLY 
70 80 90 100 110 120 

130 140 150 160 170 180 

DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHLQXRNHCRTFYRR 

I MM I M M M MM M II I II I I M MUM 1 1 1 1 1 1 I M I 1 1 It I 1 1 I M 1 1 I [ I 

DDTDFDAVEE KLGLPMFVKPAAEGSSVGVVKVKEKGRLKSVYEELKHLQGRNH CRT FYRR 
130 140 150 160 170 180 

190 200 210 220 230 240 

RRIFLPRPERQRAARHTHHSRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 

I M M M I 1 1 1 M 1 1 1 !M 1 1 f E i I I I I ) M 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 ! I i 1 

RRI FLPRPERQRAARHTHHPRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 
190 200 210 220 230 240 

250 260 270 

RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDEPX 

I MM Mill I Mill III II Mill M MM MM I 

RTGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDRPX 
250 260 270 



m093 .pep 
g093 

m093 .pep 
g093 

m093 .pep 
g093 

m093 .pep 
g093 

m093 .pep 
g093 
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Jow^eq Wing Partial ° NA sequence was identifi ed in N. meningitidis <SEQ ID 319>: 

1 ATGCAGAATT TTGGCAAAGT GGCCGTATTG ATGGGCGGTT TTTCCAGCGA 

51 ACGAGAAATC TCGCTGGACA GCGGCACCGC CATTTTGAAT GCTTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGATC CCAAGGAAAC CCCATTGTCT 

151 GAATTGAAGG CACAAGGTTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCATCCG CCATCGGCAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTGCCCGTTC CCGAGTTCGC 

351 CGTCCTGCAC GACGACACTG ATTTCGATGC CGTCGAAGAA AAATTGGGCC 

4 01 TGCCGATGTT TGTGAAACCG GCGGCCGAAG GCAGCAGCGT AGGCGTGGTA 

4 51 AAAGTCAAAG GAAAAGGCCG TCTGAAAAGC GTTTACGAAG AATTGAAACA 

501 CTTTCAGGG. CGAAAT CATT GCCGAACGGT TTATCGGCGG CGGCGAATAT 

551 TCCTGCCCTG TGTTGAACGG CAAAGGCCTG CCCGGCATAC ACATCATCCC 

601 CGCGACCGAG TTTTATGACT ACGAAGCCAA GTACAACCGC AACGACACCA 

651 TTTATCAATG TCCTTCGGAA GATCTGACCG AAGCCGAAGA AAGCCTGATG 

701 CGCGAACTGG CGGTTCGCGG CGCGCAGGCA ATCGGTGCGG AAGGCTGCGT 

751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 

801 TCAACACCCT GCCCGGTATG ACCGGCCATA G ^ 

This corresponds to the amino acid sequence <SEQ ID 320- ORF 093 a>- 

a093.pep ' " 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKAQGFQTA FNILHGTYGE DGAVQGALEL LGIPYTGSGV AASAI GMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKGKGRLKS VYEELKHFQX RNHCRTVYRR RRIFLPCVER QRPARHTHHP 

201 RDRVL*LRSQ VQPQRHHLSM SFGRSDRSRR KPDARTGGSR RAGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDRP* 

m093/a093 95.7% identity over a 276 aa overlap 

1r.093.pep "QnfgkvavJmggfssereJsldsgtailSalkskgidaJafdpketplselkaqgfq^a 

111 Hill INI I II I Mill I III Mil III III || ill || i |,|| MM 
a093 MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELI^QGFQTA 

10 20 30 40 50 60 

m093 . pep ^^GTyGXDGAVQGALlLGIPYTGSGVAASAIGMDK^RCKLIWQAiGLPVPEFAVM 
111111111 ' 1 1 " M I I I I I I I M U II I M I I I I I I I | | | | | | Tl I I I I M Tm IM 
FNILHGTYGEDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKL^QALGipVPEFAVM 
70 80 9 ° 100 no 120 



a093 



111111111 1 ' 1 1 1 1 1 1 1 1 1 11 1 " 1 1 1 " I I I I I I I I I I I I I I MI I I I M TTm T I I 
FN T T .PCJTYGE DGAVQGALELLGI PYTGSGVAASAI GMDKYT ' ' 
70 80 90 100 

XU093.POP ™^DAV^ 

130 140 150 160 170 180 



1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 N M 1 I I I M I I I I I I I | | | I M I I I I I • I II I l l 
a093 DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKS 

!30 140 150 160 

190 200 210 220 230 

m093.pep ^LPRPERQI^ 

a093 RRIFLPCVERQRPARHTHHPRDRVLXLRSQVQPQRHHLSMSFGRSD^ 

190 200 210 220 230 240 

250 260 270 

mO 93 . pep RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDEPX 
Ml 'IIMMmilMlill||MIIIMllll:|| 
aO 93 RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDRPX 

250 260 270 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 32 1>: 

g094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTGG GGTCGAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTAccggc aatggcgatg 

151 cCGTCACGGA AGCGCATCAG CTCTGCCAGC ATCAAGGCGC GCGGAATAAC 

201 GGGAATTTGC CGCTCCAACG CAgcgacaAC TTCGGgattT TCTTTCTTGA 

251 CGGCGGTAGA GGCAACGACG ACATccgcAC CGTTAACGTG TTCTGCGGTA 

301 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 322; ORF 094.ng>: 

g094 -pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMAM 
51 PSRKRISSAS IKARGITGIC RSNAATTSGF SFLTAVEATT TSAPLTCSAV 
101 WPG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 323>: 

m094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTA GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTCG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AG CGCAT C AA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCGGCGGTA 

3 01 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 324; ORF 094>: 

m094.pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMAM 
51 PSRKRINSAN IRARGITGIC CSNAATTSGF SFLTAVEVTT TSAPLTCSAV 
101 WPG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 094 shows 95.1% identity over a 103 aa overlap with a predicted ORF (ORF 094. ng) 
from N. gonorrhoeae: 

m094/g094 



10 20 30 40 50 60 

m094 . pep MYS PLPKRALVP AALS LP P I TKVGS S PAAPRMEAVRLVVWLPCVPAMAMPSRKRINS AN 

Ml II I; I I I M I I ! M I I I II ! M I M I I I I I I I I I I I I : I ! I I . I I M I I I I I :M: 

g094 MYSPLPKRAL VP AALSLPPITKVGSSPAAPRMEAVRXjWVVLPCVPAMAMPSRKRISSAS 

10 20 30 40 50 60 



70 80 90 100 

m094 . pep IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 

hilllllll i I I ; I M 1 1 ! I : ! M ^ I 1 1 1 I M 1 1 ! I : 1 1 1 1 

g 0 9 4 I KARG I TGI CRSNAATTS GFS FLTAVEATTTS APLTCS AVWPGX 

70 80 90 100 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 325>: 

a094 .seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTCG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AGCGCATCAA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCTGCGGTA 

301 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 326; ORF 094.a>: 



BNSDOCID: <WO 9957280A2J _^ 



WO 99/57280 



PCT/US99/09346 



296 



a094 .pep 

1 MYSPLPKRAL VPAALSLFFI TKVGSSPAAP RMEAVRLVW VLFCVFAMA M 
51 PSRKRINSAN IRARGITGIC CSNAATTSGF SFLTAVEVTT TSAPLTCSAV 
101 WPG* 

m094/a094 100.0% identity over a 103 aa overlap 

10 20 30 40 50 60 

m094 .pep MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLVWVLPCVPAMAMPSRKRINSAN 
M M I I I M M I t M I I I I II II ! I! M t I I I I I i I I I I II I I I II M I I I M 1 If t M ! 
a 0 94 MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLWWLPCVPAMAMPSRKRINSAN 

10 20 30 40 50 60 

70 80 90 100 

m0 94 .pep IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 
I I I I I I I I I I II M I I ! II I I I I II I I I I I Ml I I I I I I II I If 
a094 IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 327>: 

g095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AACACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGTTTT TCGACGCTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGGGTCA GTGTAGGAAA 

3 01 GAGGCATCGG ATCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

351 CGGGCGTTGG TGCCTGCGGC GTTGA 

This corresponds to the amino acid sequence <SEQ ID 328; ORF 095.ng>: 

g0 95 .pep 

1 MS FHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDK RL MQFFDAVPVG IHMVFVD IGN DGHNRGQCRK 

101 EASDRRLRQR CIRLCPSGRW CIiRR* 

The following partial DNA sequence was identified mN. meningitidis <SEQ ID 329>: 

m095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AATACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGTTTT TCGACGCTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGTGTCA GTGCAGAAAG 

301 GATGCATCCG ACCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

351 CGGGCGTTAG TGCCTGCGGC GTTGA 

This corresponds to the amino acid sequence <SEQ ED 330; ORF 095>: 

m095 .pep 

1 MS FHIiNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDK RL MQFFDAVPVG IHMVFVD IGN DGHNRCQCRK 

101 DASDRRLRQR CIRLCPSGRX CLRR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from 7V. gonorrhoeae 

ORF 095 shows 97.6% identity over a 124 aa overlap with a predicted ORF (ORF 095 .ng) 
from N. gonorrhoeae: 

m095/g095 

10 20 30 40 50 60 

m095 . pep MS FHLNMDGE FHLRADVFDVGGVDVGG I VQTVS S I RFAHFGQNRADVFAVNTQKGFAVEG 

IIIIMIMIIMIIMIIIIIMIIIIIIIMMMIIIIMIIIIIIIIIIIIIIIM 
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g095 MS FHLNMDGE FHLRADVFDVGGVDVGG I VQTVS SIR FAH FGQNRADVFAVNTQKGFAVEG 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 95 . pep HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRCQCRKDASDRRIiRQRCIRLCPSGRX 

I I I Ml I ! I MINI M II I II I I I I Ml IIMM I I I MM II I I II II Ml I MM 

gO 95 HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRGQCRKEASDRRLRQRCIRLCPSGRW 

70 80 90 100 110 120 



m0 95.pep CLRRX 

Mill 

gO 95 CLRRX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33 1>: 

a095. seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AATACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGCTTC TCAACACTGT GCCCGTAGGT ATCCATATGG 

2 51 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGTGTCA GTGCAGAAAG 

301 GATGCATCCG ACCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

351 CGGGCGTTGG TGCCTGCGGC GTTGA 



This corresponds to the amino acid sequence <SEQ ID 332; ORF 095. a>: 

a095.pep 

1 MS FHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDKRL MQLLNTVPVG IHMVFVDIGN DGHNRCQCRK 

101 DAS DRRLRQR CIRLCPSGRW CLRR* 



m095/a095 



96.0% identity in 124 aa overlap 



10 20 30 40 50 60 

m095 . pep MS FHLNMDGE FHLRADVFDVGGVDVGGIVQTVSS I RFAH FGQNRADVFAVNTQKGFAVEG 

II 1 I I M I I M I M I I I I I I I M I M I M I I I I I M I I I M I I I I ! M I M 1 I I I I I I i I 
a095 MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 

10 20 30 40 50 60 



.'963: 



70 80 90 100 110 120 

m095 . pep HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRX 

M I I I I N I II I : : : : I I I I I I II I I t I I I I I I I I I I I I! II It M N II I I I M M M 
a095 HTVDEIDKRLMQLLNTVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRW 

70 80 90 100 110 120 



m0 95.pep CLRRX 
I I I I I 

a095 CLRRX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 

333>: 

g096. seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGTGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAAATCT TCCGAAGGAC ATTGAtaaat ggtgTCGTCT CGGttgtaCt 
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201 tggcttcgta gTCGTAAAAC TCGGTTGCGG GGATGATGTG TATGCCCrrr 

251 AGCCCTTTGC CGTTCAGGAC GGGGCAGGAA TATTCGCCGC SSSSS 

301 AcgtTcggca atgaTTtcgc ccctgAAGGT GttTCAattc ttcataAACP. 
351 CTTTTCAGAc ggccttTTTC TTTGA ttcgtaAACG 

This corresponds to the amino acid sequence <SEQ ID 334; ORF 096 ne>- 

g096.pep ' * © • 

* ^"!^ GVD F QQIEFAVGI FEEIDAHAAF RTDCLCAANR QFAHQAFFGF 
51 GQIFRRTLIN GVVSWLGFV WKT^mnv YAGQPFAVQD GAG I FAAADK 
101 TFGNDFAPEG VSILRKRFSD GLFL* FAAADK 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 335>: 

ATGGCTCGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 
CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 
r™*^ CGCGAACCGC CAGTTCGCGC ATCAGGCtS SSgGC^C 

S^aaatct tccgaaggac attgataaat ggtgtcgtcg cggttgtact 

TGGCTTCGTA GTCGTAAAAC TCGGTTGCGG GAATGATGTG TATGCCGGGC 
AGCCCTTTGC CGTTCAGGAC GGGGCAGGAA TATTCGCCGC CGCCGa52£ 
ACGTTCGGCA ATGATTTCGC CC . TGAAGGT GTTTCAATTC TTCGTAAACG 
CTTTTCAGAC GGCCTTTTCC TTTGA ^^iaaacg 

This corresponds to the amino acid sequence <SEQ ID 336- ORF 096>- 

m096.pep ' 

^ ™^S? VD F ^ IEFAV GI FEEIDAHAAF RTDCLRAANR QFAHQAFFGF 
51 GQIFRRTLIN GWAWLGFV VWT.r^nw YAGQPFAVQD GAG I FAAADK 
101 TFGNDFAXEG VSILRKRFSD GLFL* AAADK 



xn096. seq 

1 
51 
101 
151 
201 
251 
301 
351 



m096/g096 96.0% identity in 124 aa overlap 

10 20 30 40 rrv 
m096.pep MARHTGQGVDFQQIEFAVGIFEETnAWnnTrPTr^T D.^n^.^ ™„ 60 



g096 



JU 40 50 £n 

T?T???TM??MMT?fM EIDM ^ FRTDCL ^ RQFAHQAFFGFG Q i ^^LiS 

MAGHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLCAANRQFAHQAFFGFGQIFRRT^ 

10 20 30 40 50 60 



n>0 9 6.pep ^VAWLGFVy^ 

5096 GWSWLGFVWKLGCGDDVYAGQPFAVQDGAGIFAAADKTFGNDFAPEGVS^ 

70 80 90 100 no 120 



m096.pep GLFLX 
I I I I I 

9096 GLFLX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33 7>: 

a096.seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CgSS2S£ 

tit rr7, GCGCG ° CGCGAACC ^ CAGTTCGCGC ATCAGGCTTT ctSSSSc 

lm t™^ TCT TCCGAAGGAC ATTGATAAAT GGTGTCGTTG CGGTTGTaS 

201 TGGCTTCGTA GTCATAAAAC TCGGTCGCGG GGATGATGTG TATGCCGGGC 

251 AGCCCTTTGC CGTTCAACAC AGGGCAGGAA TATTCGCCGC cScCGATAA* 

301 CCGTTCGGCA ATGATTTCGC CCT . GAAAGT GT^SStC SSSSS 

351 CTTTTCAGAC GGCCTTTTCC TTTGA TTCGTAAACG 

This corresponds to the amino acid sequence <SEQ ID 338; ORF 096.ng>- 

a0 96 . pep " ° 

1 MAGHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLRAANR QFAHQAFFGF 
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51 GQIFRRT LIN GVVAVVLGFV VIKL GRGDDV YAGQAFAVQH RAG I FAAADK 
101 PFGND FAXES VSILRKRFSD GLFL* 



m096/a096 92.7% identity in 124 aa overlap 

10 20 30 40 50 60 

m096 . pep MARHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLRAANRQFAHQAFFG FGQIFRRTLIN 

II I M I I t M II I I I I I I t I I I I I I I I M I I I I I I I ! I I I M I 11 I I I I I I I I I I M I ! 
a096 MAGHTGQGVD FQQ I EFAVG I FEE I DAHAAFRT DC LRAANRQFAHQAFFG FGQIFRRTLIN 

10 20 30 40 50 60 



70 80 90 100 110 120 

m096 . pep GVVAWLGFVWKLGCGNDVYAGQPFAVQDGAGIFAAADKTFGNDFAXEGVSILRKRFSD 
I M I I I I I I I I : I M I : I I I I I I I I I I 1 I I I I I I I I I I I I I I M : I I I I I I I I I I 
a096 GVVAVVLGFWIKLGRGDDV YAGQAFAVQHRAG I FAAADK P FGN D FAXE S VS I LRKRFSD 

70 80 90 100 110 120 

m096.pep GLFLX 
I I I I I 

a096 GLFLX 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 339>: 

g097 . seq 

1 ATGGATATTT CAAAACAAAC ATTGCTGGAT AGGGTTTTTA ACCTGAAGGC 

51 AAACGGTACG ACGGTACGTA CCGAGTTGAT GGCGGGTTTG ACGACCTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAATC CCCTGATTTT GGGCGAGACC 

151 GGAATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CATCCGCCAT 

201 CGGCTGTTTT GTCATGGGTT TTATCGGCAA CTATCCGATT GCGCTTGCCC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

3 01 GGCGTGCCTT GGCAGGTGGC GTTGGGTGCG GTGTTCATTT CCGGTCTGAT 

3 51 TTTCATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCCG CCGGTATCGG TTTGTTTTTG 
451 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATC CGGCAAC 
501 CTTGGTCGGC TTGGGCGATA TTCATCAGCC CAGCGCACTG TTGGCATTGT 
551 TCGGTTTTGT CATGGTGGTC GTATTGGGGT ATTTCCGCGT TCAAGGCGCA 
601 ATCATCATCA CCATTCTGAC GATTACCGTC ATTGCCAGCC TGATGGGTTT 
651 GAACGAGTTT CACGGCGTGG TCGGCGAAGT ACCGGGCATT GCGCCGACCT 
701 TTATGCAGAT GGATTTTAAA GGTCTGTTTA CCGTCAGCAT GGTCAGCGTG 
751 ATTTTCGTCT TCTTCTTGGT CGATTTGTTC GACAGTACCG GAACGCTGGT 
8 01 CGGCGTATCC CACCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 
851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CCATTGTGGC AGGTGCGGCT 
901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGCGCGG CGGGCGTATC 
951 GGCAGGCGGA CGGACCGGCC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TGGCGTGTCT GATGTTCTCC CCATTGGCGA AAAGTGTTCC GGTATTTGCC 

1051 ACCGCGCCCG CACTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGACATTGAT TGGGACGATA TGACTGAAGC CGCGCCCGCG TTC CTGACCA 

1151 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCCTTCGGC 

12 01 TTCATCAGCT ATGCCGTGGT CAAACTTTTG TGTCGCCGGA CTGGGGACGT 
1251 GCCGCCTATG GTATGGGTTG TTGCCGTATT GTGGGCATTG AAATTCTGGT 

13 01 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 340; ORF 097.ng>: 

g097 .pep 

1 MDISKQTLLD RVFNLKANGT TVRTE LMAGL TTFLTMCYIV IVN PLILGET 

51 GMDM GAVFVA TCIASAIGCF VM GFIGNYPI AliAPGMGLNA YFTFAWKGM 

101 GVPWQ VALGA VFISGLIFIL FSFFKVREML VNALPMGLKM SIAAGIGLFL 

151 ALISLKGAGI I V ANP ATLVG LGDIHQPS AL LALFGFVMW VLGYF RVQGA 

201 IIITILTITV IASLMGL NEF HGWGEVPGI APTFMQMDFK GLFTVSMVSV 

251 I FVFFLV DLF DSTGTLVGVS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

301 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPVFA 

3 51 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 
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4 01 FISYAWKLL CRRTGDVPPM VWWAVLWAL KFWYLG * 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 341 >• 

m097.seq ° ^ ■ 

1 ATGGACACTT CAAAACAAAC ACTGTTGGAC GGGATTTTTA AGCTGAAGGC 

51 AAACGGTACk ACGGTGCGTA CCGAGTTGAT GGCGGGTTTG ACAACTTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAACC CTCyGATTTT GGGCGAGACC 

151 GGCATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CGTCTGCCAT 

2 01 CGGCTGTTTT GTTATGGGTT TTGTCGGCAA CTATCCGATT GCACTCGCAC 
251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

3 01 GGCGTGCCTT GGCAGGTTGC GTTGGGTGCG GTGTTCATCT CCGGTCTGAT 

3 51 TTTTATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCTG CCGGTATCGG TTTGTTTTTG 
451 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 

5 01 CTTGGTCGGT TTGGGCGATA TTCATCAGCC GTCCGCGTTG TTGGCATTGT 
551 TCGGTTTTGC TATGGTGGTC GTATTGGGAC ATTTCCGCGT TCAAGGCGCA 
601 ATCATCATCA CCATCTTGAC CATTACCGTC ATTGCCAGCC TGATGGGTTT 
651 GAATGAATTT CACGGCATCA TCGGCGAAGT ACCGAGCATT GCGCCGACTT 
701 TTATGCAGAT GGATTTTGAA GGCCTGTTTA CCGTCAGCAT GGTCAGTGTG 
751 ATTTTCGTCT TCTTCTTGGT CGATCTATTT GACAGTACCG GAACGCTGGT 
801 CGGCATATCC CACCGTGCCG GGCTGCTGGT GGACGGTAAG CTGCCCCGCC 
851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CCATTGTGGC AGGTGCGGCT 
901 TTGGGTACTT CTTCCACCAC GCCTTATGTG GAAAGCGCGG CGGGCGTATC 
951 GGCAGGCGGA CGGACCGGCC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA AAAGTGTTCC CGCTTTTGCC 

1051 ACCGCGCCCG CCCTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGATATTGAT TGGGACGATA TGACGGAAGC CGCACCTGCG TTCCTGACCA 

1151 TTGTTTTCAT GCCGTTTACT TATTCGATTG CAGACGGCAT CGCTTTCGGC 

12 01 TTCATCAGTT ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT 
1251 TCCGCCTATG GTATGGATTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 

13 01 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 342* ORF 097>* 

m097.pep ? 

1 MDTSKQTLLD GIFKLKANGT TVRTE LMAGL TTFLTMCYIV IVW PyTT.ftRT 

5 1 GMDMGAVFVA TCIASAIGCF VMGFVnTJVPT t T.a Dm .n. TlNr Yr7TriTVr.Cn 

101 GVPWQVALGA VFISGLIFIL FSFFKVP*mt. VNALPMGLKM SIAAGIGLFL 

151 ALISLKGAGI IVANPATLVG LGDIHQPS AL LALFGFAMW VLGHF W^I 

201 XIXTILTITV IASLMGLNEF HGIIGEVPSI APTFMQMDFE GLFTVSMVSV 

251 I FVFFLVDLF DSTGTLVGIS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

3 01 LGTSSTTPYV ESAAGVSAGG R TGLTAVTVG VLMLACLMF S PLAKSVPAFA 
351 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 

4 01 FISYAWKLL CRRTKDVPPM VWIVAVLWAL KFWYLG * 

Computer analysis of this amino acid sequence gave the following results- 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 097 shows 96.3% identity over a 436 aa overlap with a predicted ORF (ORF 097 mrt 
from N. gonorrhoeae: ' &J 

m097/g097 



m097 .pep 
g097 

m097 .pep 
g097 



10 20 30 40 50 60 

I M )T f ^?T^?^ 1 F ^KANGTTVRTELMAGLTTFLTMC YI VI VNPX I LGETGMDMGAVFVA 

11 "Mill HHIMIMIIIIIIIIMIIMMIIMIM 1 1 1 1 1 1 1 1 1 nm 

MDI SKQTLLDRVFNLKANGTTVRTELMAGLTTFLTMCYI VI"TO^ 

10 20 30 40 50 60 

70 80 90 100 HO 120 

NNIIIIIIIII|:|||||||||MMIIIIMIMIIIIIIIIIIMMIMIMiM 

TCIASAIGCFWGFIGNYPIALAPGMG^^ 

70 80 50 100 no 120 



130 140 150 160 



170 180 
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m097.pep FSFFKVREMLWALPMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 

I I i I I I I I I Ml I M I II M II I II I I I MINI I M I II II II I I i I II I II I I I II M 

g097 FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 

130 140 150 160 170 180 



190 200 210 220 230 240 

m0 97.pep LALFGFAMVWLGHFRVQGAI IITILTITVIASLMGLNEFHGIIGEVPSIAPTFMQMDFE 

I II I I M I MM Ml I MM I MM II I IMM II M I IMMMI M Mill MM I M 

gO 97 LALFGFVMVWLGYFRVQGAIIITILTITVIASLMGI^EFHGWGEVPGIAPTFMQMDFK 

190 200 210 220 230 240 



250 260 270 280 290 300 

mO 97 . pep GLFTVSMVSVIFVFFLVDLFDSTGTLVGISHRAGLLVDGKLPRLKRALLADSTAIVAGAA 

I 1 1 1 I ! M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M H ! 1 1 1 1 I M f I 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 f M 

g097 GLFTVS^SVIFVFFLVDLFDSTGTLVGVSHRAGLLVDGKLPRLKRALIADSTAIVAGAA 

250 260 270 280 290 300 



310 320 330 340 350 360 

mO 9 7 . pep LGTSSTTP YVESAAGVS AGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 

I i I i I I I I I II I I I I I I ! I I I I E I I I 1 I I I I I I I I I I ! I I I I I I I I I : I I I | I | | | | | | I 
gO 9 7 LGTSS TTP YVESAAGVS AGGRTGLTAVTVGVLMLACLMFS PLAKS VPVFATAPALLYVGT 

310 320 330 340 350 360 



370 380 390 400 410 420 

m097 . pep QMLRS ARD I DWDDMTE AAPAFLT I VFMP FT YS I ADG I AFGF I S YAWKLLCRRTKDVP PM 

MM II Mllli : MIIIMM IMM hi IIMMI Ml Ml Ml MMMM Mill 

g097 QMLRS ARD I DWDDMTE AAPAFLT I VFMP FT YS I ADG I AFGF I S YAWKLLCRRTGD VP PM 

370 380 390 400 410 420 



430 

mO 9 7 . pep VWI VAVLWALKFW YLGX 

I I : M I I I I I I M M I I 
g 0 9 7 VW WAVLWALKFW YLGX 

430 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 343> 
a0 97 . seq 

1 ATGGACACTT CAAAACAAAC ACTGTTGGAC GGGATTTTTA AGCTGAAGGC 

51 AAACGGTACG ACGGTGCGTA CCGAGTTGAT GGCGGGTTTG ACAACTTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAACC CTCTGATTTT GGGCGAGACC 

151 GGCATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CGTCTGCCAT 

201 CGGCTGTTTT GTTATGGGTT TTGTCGGCAA CTATCCGATT GCACTCGCAC 

2 51 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

301 GGCGTGCCTT GGCAGGTTGC GTTGGGTGCG GTGTTCATCT CCGGTCTGAT 

351 TTTCATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCTG CCGGTATCGG TTTGTTTTTG 

4 51 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 

501 CTTGGTCGGC TTGGGCGATA TTCATCAGCC GTCCGCGTTG TTGGCACTGT 

551 TCGGTTTTGC CATGGTGGTC GTATTGGGAC ATTTCCGCGT TCAAGGCGCA 

601 AT CAT CAT C A CCATTTTGAC GATTACCGTC ATTGCCAGCC TGATGGGTTT 

651 GAACGAATTT CACGGCATCA TCGGCGAAGT GCCGAGCATT GCGCCGACTT 

7 01 TTATGCAGAT GGATTTTAAA GGGTTGTTTA CCGTCAGCAT GGTCAGCGTG 

7 51 ATTTTCGTCT TTTTCCTAGT CGATCTGTTC GACAGTACCG GAACACTGGT 
801 CGGTGTATCG CATCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 

8 51 TGAAACGCGC ACTGCTTGCA GACTCTACCG CTATTGTGGC AGGTGCGGCT 
901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGTGCGG CGGGCGTATC 
951 GGCAGGCGGG CGGACAGGTC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA AAAGTGTTCC CGCTTTTGCC 

1051 ACCGCGCCCG CCCTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGACAT CG AT TGGGACGATA TGACGGAAGC CGCACCCGCA TTCCTGACCA 

1151 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCTTTCGGC 

1201 TTCATCAGTT ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT 

1251 TCCGCCTATG GTATGGATTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 
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1301 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 344; ORF 097.a>: 
a097 .pep 

1 MDTSKQTLLD GIFKLKANGT TVRTE LMAGL TTFLTMCYIV TV NPT.TT.CEyr 

5 1 GMDMG AVFVA TCIASAIGCF VM GFVfiNYPT aT.ap^r.r^ yp^,^^ 

101 GVPWQ VALGA VFISGLIFIL FS FFKVRF.MT. VNALPMGLKM SIAAGIGLFL 

151 ALISLKGAGI IVANPATLVG LGDIHQPS AL LALFGFAMVV VLG H FRVpf^n 

201 HITILTIT V IASLMGL NEF HGIIGEVPSI APTFMQMDFK GLFTV SMVSV 

251 IFVFFLVDLF DSTGTLVGVS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

301 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPAFA 

351 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 

4 01 FISYAWKLL CRRTKDVPP M VWIVAVLWAL KFWYLG * 

m097/a097 99.3% identity in 436 aa overlap 



a097 

m097 .pep 
a097 



10 20 30 40 50 60 

m097 . pep MDTSKQTLLDGIFKLKANGTTVRTELMAGLTTFLTMCYIVIVNPXILGETGMDMGAVFVA 
IM,IIMlt!, HMMllf||||MMMMII!M||||M I I I I ! I I II I i I I I I 
MDTSKQTLLDGIFKLKANGTTVRTELMAGLTTFLTMCYIVIVNPLILGETGMDMGAV 

10 20 30 40 50 60 

70 80 90 100 no 120 

TCIASAIGCFVMGFVGNYPIALAPGMGLNAYFTFAWKGMGVPWQVALGAVFISGLIFIL 

TCIASAIGCFVMGFVGNYPIAIAPGMGLNAYFTFAWKGMGVPWQVALGAVFISGLIFIL 
70 80 90 100 HO 120 

130 140 150 160 170 ifin 

m0 97. pep FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 
H I j I I M II I I I I II I I M I M M I I I! I I I | M I I I M I M I I I I M I j | | I I I I I I i 
a097 FSFFKVREMLVNALPMGLPCMSIAAGIGLFLALISLKGAGIIVANPATLVGL^ 

130 140 150 160 170 180 

190 200 210 220 230 7An 

m097 . pep LALFGFAMVVVLGHFRVQGAIIITILTITVIASLMGLNEFHGIIGEVPSIAPTFMQMDFE 

1 1 1 1 1 1 1 1 M 1 1 " 11 1» N I I I i I I I i I I M I M i M I II I II ! I | | | I? M M MM 
a097 LALFGFAMVWLGHFRVQGAIIITILTITVIASLMGLNEFHGIIGEVPSI^ 

190 200 210 220 230 240 

250 260 270 280 290 300 

GL^VS^SVIFVFFLVDLFDSTGTLVGISHRAGLLVDGKLPRLKP^LADSTAIVAGAA 



m097 .pep 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 M f 1 1 1 T 7VT7 V7T™iTTTTT 1 1 n , ; , 

a097 GLFTVSMVSVIFVFFLVDLFDSTGTLVGVSHRAGLLVDGKL^ 

250 260 270 280 290 300 

310 320 330 340 350 

m097. P ep LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 
! MMI,,lll,l ""IMMMMMMMMIMI!l!MIMMMiMIMMII 
LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 
310 320 3 30 340 350 360 



a097 



a097 



370 380 390 



400 410 420 



m097. P ep QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAVVKLLCRRTKDVP 



, ^xjjjvim ■ ,, u v DM 

N I N I I I M I M I I I I I I M I I I ] l 1 1 1 II I M I II I I I I I I I I I I 1 1 1 1 1 ,, I I , 1 , 
QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRT^ 

370 380 390 400 410 420 



430 

mO 9 7 . pep VWI VAVLWALKFWYLGX 

I I I 1 I I M I I I I I f I I I 
a0 97 VWI VAVLWALKFWYLGX 

430 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 345>: 

g098 . seq 

1 ATGACCGCCG ACGGTCTCTT CGTCGCTTTC AACTTCAATA CGTTTGCCGT 

51 TGTG CGAAT A TTGATACCAG TACAGCAGGA TGCTGCCCAG GCTGGCGATC 

101 AGTTTGTCGG CGATGTCGCG CGCTTCGCTG TCGGGATGGC TTTCGCGTTC 

151 GGGATGAACG CAGCCGAGCA TGGACACGCC GGTACGCATC ACGTCCATCG 

2 01 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAAT C AC ACGGATAGGC 
251 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 
301 GTTGGGCAGA TGGCCGTGAA TCAGCAAGTG TGCGACTTCT TCAAACTCGC 

3 51 ATTTTTGTGC CAAATTAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ED 346; ORF 098.ng>: 

g098.pep 

1 MTADGLFVAF NFNTFAWRI LIPVQQDAAQ AGDQFVGDVA RFAVGMAFAF 

51 GMNAAEHGHA GTHHVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV CDFFKLAFLC QIRMS* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 347>: 

m098 . seq 

1 ATGACCGCCG ATGGTCTCTT CGTCGCTTTC AACCTCAATG CGTTTGCCGT 

51 TGTGCGAATA TTGATACCAG TACAAGAGGA TGCTGCCGAG GCTGGCGATC 

101 AGTTTGTCGG CGATGTCGCG CGCTTCACTT TCCGGATGGC TTTCACGTTC 

151 AGGATGAACG CAGCCCAGCA TGGATACGCC GGTACG C ATT ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

251 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

301 GTTGGGCAGA TGGCCGTGAA TCAGCAGGTG GGCGACTTCT TCAAACTCGC 

351 ATTTTTGTGC CAAATCAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 348; ORF 098>: 

m098 -pep 

1 MTADGLFVAF NLNAFAWRI LIPVQEDAAE AGDQFVGDVA RFTFRMAFTF 

51 RMNAAQHGYA GTHYVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV GDFFKLAFLC QIRMS* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 098 shows 89.6% identity over a 125 aa overlap with a predicted ORF (ORF 098.ng) 
from N. gonorrhoeae: 

m098/g098 

10 20 30 40 50 60 

MTADGLFVAFNLNAFAWRI LI PVQEDAAEAGDQFVGDVARFTFRMAFTFRMNAAQHGYA 
I III I M II I h i HI I M I III I hll h II I! i MIM I h 1 I i : [ M||:||:| 
MTADGLFVAFNFNTFAWRI LI PVQQDAAQAGDQFVGDVARFAVGMAFAFGMNAAEHGHA 
10 20 30 40 50 60 

70 80 90 100 110 120 

GTHYVHRMGMCRQAFQNFNHTDRQAMGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 

MhMMMMMMMMIMMIIIIMMMMIMMMIMMI MINIMI 

GTHHVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVCDFFKLAFLC 
70 80 90 100 110 120 



m098 -pep 
g098 

m098 .pep 
g098 



m098.pep QIRMSX 

MIMI 

g098 QIRMSX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 349>: 

a098 .seq 

1 ATGACCGCCG ATGGTCTCTT CGTCGCTTTC AACCTCAATG CGTTTGCCGT 
51 TGTGCGAATA TTGATACCAG TACAAGAGGA TGCTGCCGAG GCTGGCGATC 
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101 AGTTTGTCGG CGATGTCGCG CGCTTCACTT TCCGGATGGC TTTCACGTTC 

151 AGGATGAACG CAGCCCAGCA TGGATACGCC GGTACGCATT ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

251 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

301 GTTGGGCAGA TGGCCGTGAA TCAGCAGGTG GGCGACTTCT TCAAACTCGC 

351 ATTTTTGTGC CAAATCAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 350; ORF 098 a>- 
a098.pep 

1 MTADGLFVAF NLNAFAWRT LIPVQEDAAE AGDQFVGDVA RFTFRMAFTF 
51 RMNAAQHGYA GTHYVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGOLEF 
101 VGQMAVNQQV GDFFKLAFLC QIRMS* 

m098/a098 100.0% identity in 125 aa overlap 

10 20 30 40 50 fin 

m098 . pep MTADGLEVAFNLNAFAWRILIPVQEDAAEAGDQFVGDVARFTFRMAFTFRMNAAOHGYA 
M NtllllllMtlllllMIIMIjMIII!MilMllf!IIIM|t|||||Mill 
a098 MTADGL FVAFNLNAFA WRI LI PVQE DAAE AGDQFVG DVARFT FRMAFT FRMN^QHG YA 

10 20 30 40 50 60 

70 80 90 100 HO ion 

mO 98 . pep GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLE FVGQMAVNQQVGDFFKLAFLC 

1 11 1 11 ' M 1 1 M 1 11 1 M 1 ' M i I I I [ I M I I I I I || M I M I I I M Ml i mm 
GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLE FVGQMAVNQQVGDFFKLAFLC 
70 80 90 100 HQ 120 



a098 



m098.pep QIRMSX 
MINI 

a098 QIRMSX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 35 1>- 
g099.seq ^ • 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTGGA 

SI GCTGACGGGC AAACGGCAGG CGGGCATTAC TGCCACAGAC ATCGTGTTGG 

101 CACTGACCGA ATTCTTGCGT AAAGAGCGCG TGGTCGGGGC GTTTGTCGAA 

151 TTTTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCCATG TTCGCCATCG 

251 ACGCGCAAAC TATTGATTAT TTGAAACTGA CCGGACGTGA CGACGCGCAG 

301 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTAT GGGCAGGTGG 

351 CTTGAAAACC GCCGTTTATC CGCGCGTTTT GAAATTTGAT TTGAGCAGCG 

401 TAACGCGCAA TATGGCAGGC CCGAGCAACC CGCACGCGCG TTTTGCCACC 

451 GCCGATTTGG CGGCGAAAGG GCTGGCGAAG CCTTACGAAG AGCCTTCAGA 

SOI CGGCCAAATG CCTGACGGTG CAGTGATTAT TGCCGCGATT ACTTCGTGTA 

551 CCAATACTTC CAACCCGCGC AACGTTGTCG CCGCCGCACT GTTGGCACGC 

601 AATGCCAACC GCCTCGGCTT GAAACGCAAA CCTTGGGTGA AATCTTCGTT 

651 TGCCCCGGGT TCAAAAGTAG CCGGAATCTA TTTGAAAGAA GCAGGCTTGT 

701 TGCCCGAAAT GGAAAAACTC GGCTTCGGTA TCGTCGCCTT CGCATGTACC 



ACCTGTAACG GCATGAgcgG CGCGCTcgaC CCGAAAATCC AACAAGAAAT 



751 

801 CATCGACCGC GAtttgtacg cCACCGCCGT ATTGTCAGGC AACCGCAACT 
851 TCGACGGCCG TATCCATCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 



901 
951 
1001 
1051 
1101 
1151 
1201 
1251 



CCTTTGGTCG TTGCCTACGC ATTGGCAGGT AGCATCCGTT TCGATATTGA 
AAACGACGTA CTCGGCGTTG CAGACGGCCG CGAAATCCGC CTGAAAGATA 
TCTGGCCGAC AGACGAAGAA ATCGATGCCA TCGTTGCCGA ATATGTGAAA 
CCGCAACAAT TCCGCGACAT TTATATCCCG ATGTCCGACA CCGGCACAGC 
GCAAAAAGCA CCAAGCCCGC TGTACGACTG GCGACCGATG TCCACCTACA 
TCCGCCGTCC GCCCTATTGG GAAGGCGCAC TGGCAGGGGA ACGTACATTA 
AGAGGTATGC GTCCGCCGGC GATTTTGCCC GACAACATCA CCACCGACCA 
CATCTCgcca tCCAATGCGA TTTTGGCCGG cagTGCcgca ggtgaATATT 



BNSDOCID: «WO 99S7280A2J_> 



WO 99/57280 




PCT/US99/09346 



305 



13 01 TGGCGAAAAT GGGTTTGCCT GAAGAagaCT 

13 51 CGCGGCGACC ACTTGACCGC CCAACGCGCA 

14 01 GTTTAACGAA ATGGTGAGAA ACGAAGACGG 
14 51 tggcacgcgT tgaacCAGAA GGCCAAACCA 
1501 GAAAC CTATA TGAACCGCAA ACAGCCGCTT 
1551 CTATGGTCAA GGCTCAAGCC GCGACTGGGC 
1601 CGGGTGTGGA AGCCATCGCC GCCGAAGGTT 
16 51 AACCTCATCG GCATGGGCGT CTTGCCGCTG 
1701 CCGCCATACC CTGCAACTGG ACGGTACGGA 
1751 AACGCACACC GCGCTGCGGC CTGACCCTCG 
1801 GAAACCGTCG AAGTTCCGGT TACCTGCCGC 
18 51 ATTGGTATAT GAAGCCGGCG GCGTATTGCA 
1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 352; ORF 099.ng>: 

g099 .pep 

1 MLGRASMMRL PDIVGVELTG KRQAGITATD IVLALTEFLR KE RWGAFVE 

51 FFGEGARSLS IGDRATISNM TPE FGATAAM FAIDAQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWAGGLKT AVYPRVLKFD L S S VTRNMAG PSNPHARFAT 

151 ADLAAKGLAK PYEEPSDGQM PDGAVIIAAI TSCTNTSNPR NWAAALLAR 

201 NANRLGLKRK PWVKSSFAPG SKVAGIYLKE AGLLPEMEKL GFGIVAFACT 

251 TCNGMSGALD PKIQQEIIDR DLYATAVLSG NRNFDGRIHP YAKQ AFLASP 

301 PLWAYAIAG SI RFDIENDV LGVADGREIR LKDIWPTDEE IDAIVAEYVK 

3 51 PQQFRDIYIP MSDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 

401 RGMRPPAILP DNITTDHISP SNAI LAGSAA GEYIoAKMGLP EEDFNSYATH 

451 RGDHLTAQRA TFANPKLFNE MVRNEDGSVR QGSLARVEPE GQTMRMWEAI 

501 ETYMNRKQPL IIIAGADYGQ GSSRDWAAKG VRLAGVEAIA AEGFERIHRT 

551 NLIGMGVLPL QFKPGTNRHT LQLDGTETYD WGERTPRCG LTLV I HRKNG 

601 ETVEVPVTCR PDTAEEALVY EAGGVLQRFA QDFLEGNAA* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 353>: 

m099.seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTTGA 

51 GCTGAACGGC AAACGG CAGG CGGGCATTAC GGCGACGGAT ATTGTGTTGG 

101 CACTGACCGA GTTTCTGCGC AAAGAACGCG TGGTCGGGGC GTTTGTCGAA 

151 TTCTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCG CGATG TTCGCTATTG 

251 ATGAGCAAAC CATTGATTAT TTGAAACTGA CCGGACGCGA CGACGCGCAG 

3 01 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTGT GGGCAGATGC 

3 51 CTTGAAAACC GCCGTTTATC CTCGCGTTTT GAAATTTGAT TTGAGCAGCG 

4 01 TAACGCGCAA TATGGCAGGC CCAAGTAACC CGCATGCCCG TTTTGCGACC 
4 51 GCCGATTTGG CGGCGAAAGG GCTGGCGAAG CCTTACGAAG AGCCTTCGGA 
501 CGGCCAAATG CCCGACGGCT CGGTCATCAT CGCCGCGATT AC CAGTTGCA 
551 CCAACACTTC CAACCCGCGC AACGTTGTTG CCGCCGCGCT CTTGGCACGC 
601 AATGCCAACC GTCTCGGCTT GAAACGCAAA CCTTGGGTGA AATCTTCGTT 
651 TGCCCCGGGT TCAAAAGTAG CCGAAATCTA TTTGAAAGAA GCGGGCCTGT 
701 TGCCCGAAAT GGAAAAACTC GGCTTCGGTA TCGTCGCCTT CGCCTGCACC 
751 ACCTGCAACG GCATGAGTGG CGCGCTGGAT CCGAAAATCC AGAAAGAAAT 
8 01 CATCGACCGC GATTTGTACG CCACCGCCGT ATTATCAGGC AACCGCAACT 
851 TCGACGGCCG TATCCACCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 
901 CCGTTGGTCG TTGCCTACGC GCTGGCAGGC AGTATCCGTT TCGATATTGA 
951 AAACGACGTA CTCGGCGTTG CAGACGGCAA GGAAATCCGC CTGAAAGACA 

1001 TTTGGCCTGC CGATGAAGAA ATCGATGCCG TCGTTGCCGA ATATGTGAAA 

1051 CCGCAGCAGT TCCGCGATGT GTATGTACCG ATGTTCGACA CCGGCACAGC 

1101 GCAAAAAGCA CCCAGTCCGC TGTACGATTG GCGTCCGATG TCCACCTACA 

1151 TCCGCCGTCC GCCTTACTGG GAAGGCGCGC TGGCAGGGGA ACGCACATTA 

12 01 AGAGGTATGC GTCCGCTGGC GATTTTGCCC GACAACATCA CCACCGACCA 
1251 CCTCTCGCCG TCCAATGCGA TTTTGGCCGT CAGTGCCGCA GGCGAGTATT 

13 01 TGGCGAAAAT GGGTTTGCCT GAAGAAGACT TCAACTCTTA CGCAACCCAC 
1351 CGCGGCGACC ACTTGACCGC CCAACGCGCT ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAAAA ACGAAGACGG CAGCGTGCGC CAAGGCTCGT 
14 51 TCGCCCGCGT CGAACCCGAA GGCGAAACCA TGCGCATGTG GGAAGCCATC 
1501 GAAACCTATA TGAACCGCAA ACAGCCGCTC ATCATCATTG CCGGTGCGGA 



TCAACTCTTA 
ACCTTCGCCA 
CAGCGTACGC 
TGCGCATGTG 
ATCATCATTG 
GGCGAAGGGC 
TCGAGCGCAT 
CAATTCAAAC 
AACCTACGAC 
TGATT CACCG 
CCCGATACCG 
ACGGTTTGCA 



CGCAACCCAC 
ATCCGAAACT 
CAAGGTtcgt 
GGAAGCCATC 
CCGGTGCGGA 
GTGCGGCTGG 
CCACCGCACC 
CCGGCACCAA 
GTTGTCGGCG 
TAAAAACGGA 
CAGAAGAAGC 
CAGGACTTTT 
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1551 CTATGGTCAA GGCTCAAGCC GCGACTGGGC TGCAAAAGGC GTACGCCTCG 

1601 CCGGCGTAGA AGCGATTGTT GCCGAAGGCT TCGAGCGTAT CCACCGCACC 

1651 AACCTTATCG GCATGGGCGT GTTGCCGCTG CAGTTCAAAC CCGACACCAA 

1701 CCGCCATACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTGGTCGGCG 
^ CGCACACC GCGCTGCGAC CTGACCCTCG TGATTCACCG TAAAAACGGC 

1801 GAAACCGTTG AAGTTCCCGT TACCTGCTGC CTCGATACTG CAGAAGAAGT 

1851 ATTGGTATAT GAAGCCGGCG GCGTGTTGCA ACGGTTTGCA CAGGATTTTT 

1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 354- ORF 099>- 

m099.pep ' 

1 MLGRASMMRL PDIVGVELNG KRQAGITATO I VLALTEFLR KERWGAFVE 

51 FFGEGARSLS I GDRATI SUM TPEFGATAAM FAIDEQTIDY LKLTGRDDAO 

101 VKLVETYAKT AGLWADALKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAAKGIAK PYEEPSDGQM PDGSVIIAAI TSCTNTSNPR NWAAALLAR 

201 NANRLGLKRK PWVKSSFAPG SKVAEIYLKE AGLLPEMEKL GFG IVAFACT 

251 TCNGMS G ALD PKIQKEIIDR DLYATAVLSG NRNFDGRIHP YAKQAFLAS P 

301 PLWAYALAG SIRFDIENDV LGVADGKEIR LKDIWPADEE IDAWAEYVK 

351 PQQFRDVYVP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 

401 RGMRPLAILP DNITTDHLSP SNAILAVSAA GEYLAKMGLP EEDFNSYATH 

451 RGDHLTAQRA TFANPKLFNE MVKNEDGSVR QGSFARVEPE GETMRMWEAI 

501 ETYMNRKQPL IIIAGADYGQ GS S RDWAAKG VRLAGVEAIV AEGFERIHRT 
^ IGMGVLPL QFKPDTNRHT LQLDGTETYD WGERTPRCD LTLVIHRKNG 

601 ETVEVPVTCC LDTAEEVLVY EAGGVLQRFA QDFLEGNAA* 

Computer analysis of this amino acid sequence gave the following results- 
Homology with a predicted ORF from N. gonorrhoeae 

°Z iden,i,y over a 639 M overiap *"* a predicted ORF < 0RF 

m099/g099 



10 20 30 



40 50 60 



m099 . pep MLGRAS MM^PDIVGVELNGKRQAGITATDIVLALTEFLRKERVVGAFVEFFGEGARSLS 

9099 MLGRASMMRLPDIVGVELTGKRQAGITATOIVLALTE^ 

10 20 3° 40 50 60 

70 80 90 100 no i2o 

MIMMIIMIIIMMIII IMIIMIIIIIllMMM 



m099 .pep 
g099 

m099.pep 
g099 

m099.pep 
g099 



IGDRATISNMTPEFGATAAMFAIDAQTIDYLKLTGRDDAQVKLVETYAKTAGLWAGGLKT 
70 80 90 ioo 



110 120 



130 140 150 160 170 ISO 

N 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M I M IIIIIIIIMIIIII INN Ill-Mi mi 

AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGIiAKPYEEPSDGQMP 

130 140 150 160 170 180 

190 200 210 220 230 240 

IT ^ TTT?TTTTT^M^'^^^ i '^l'^^''f ^'^'^'^'^^ FAPGSKVAE I YLKE AGLLPEME KL 
J, 1 I ' I I I I I I I I I I I I I I | I | | | | | | | | | | I I I | I | I | I | | I I I I III 

190 200 210 220 230 240 

m099.pep T^T^^TT n'^'i'f T^^T^f Qkei idr^lyatavlsgnrnfdgrihpyakqafias p 

,099 

° 260 270 280 290 300 

310 320 3 30 340 350 360 
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PL WAYAIiAGS I RFD I ENDVLGVADGKE I RLKD I W PADEE I DAWAE YVKPQQFRDVYVP 

IMMIIMI II I IMMMIMM MIIMM IIMMIM Ml I Ml I IMIMMM 

PLWAYAIiAGSIRFDIENDVLGVADGREIRLKDIWPTDEEIDAIVAEYVKPQQFRDIYIP 
310 320 330 340 350 360 

370 380 390 400 410 420 

MFDTGTAQKAPS PLYDWRPMST YI RRPPYWEGALAGERTLRGMRPLAI LPDN I TTDHLS P 

I M I I i I I I I i I 1 1 i I I I M ! I I i I 1 1 M II 1 1 II I I I I I I I I ] 1 1 ! ! I I 1 M I I : I I 

MSDTGTAQKAPS PLYDWRPMSTYI RRPP YWEGALAGERTLRGMRP PAI LPDNI TTDH ISP 
370 380 390 400 410 420 

430 4.4 0 450 460 4 70 480 

SNAILAVSAAGEYLiAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVKNEDGSVR 

I Mill M I M I Mill II II I M IM I I Mill ! I I I I I I I M M I! I MMI ! I M I 

SNAILAGSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVRNEDGSVR 
430 440 450 460 470 480 

490 500 510 520 530 540 

QGSFARVEPEGETMRMWEAIETYMNRKQPLI 1 1 AGADYGQGS SRDWAAKGVRLAGVEAI V 

II h I I I II I h I II M I I II I I I I I I I I I I M t I II I I I I I I I I I I I M I II I II II I : 
QGSLARVEPEGQTMRMWEAI ETYMNRKQPLI 1 1 AGADYGQGS SRDWAAKGVRLAGVEAI A 

490 500 510 520 530 540 

550 560 570 580 590 600 

AEGFERIHRTNLIGMGVLPLQFKPDTNRHTLQLDGTETYDWGERTPRCDLTLVIHRKNG 

IIIIIIIIIIIIIIIIIMIIMI IMIIIIIIIIIIIIIIIMIIII IMIMIIM 

AEGFERIHRTNLIGMGVLPLQFKPGTNRHTLQLDGTETYDWGERTPRCGLTLVIHRKNG 
550 560 570 580 590 , 600 

610 620 630 640 

ETVEVPVTCCLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 

IIIIIIIM M II Ml II M I II M II I MMM I II! 

ETVEVPVTCRPDTAEEALVYEAGGVLQRFAQDFLEGNAAX 
610 620 630 640 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 355> : 
a099. seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTTGA 

51 GCTGAACGGC AAACGGAAGG CGGGCATTAC GGCGACGGAT ATTGTGTTGG 

101 CACTGACCGA GTTTCTGCGC AAAGAACGCG TGGTCGGGGC GTTTGTCGAA 

151 TTCTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCGATG TTCGCTATTG 

251 ATGAGCAAAC CATTGATTAT TTGAAACTGA CCGGACGCGA CGACGCGCAG 

301 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTGT GGGCAGATGC 

351 CTTGAAAACC GCCGTTTATC CGCGCGTTTT GAAATTTGAT TTGAGCAGCG 

401 TAACGCGCAA TATGGCAGGC CCGAGCAACC CGCACGCGCG TTTTGCGACC 

4 51 GCCGATTTGG CCGGCAAAGG CTTGGCTAAA CCTTACGAAG AGCCTTCAGA 

501 CGGCCAAATG CCTGACGGTG CAGTGATTAT TGCCGCGATT ACTTCCTGTA 

551 CCAATACTTC CAATCCGCGC AACGTTGTCG CCGCCGCGCT GTTGGCACGC 

601 AATGCCAACC GCCTCGGCTT GCAACGCAAA CCTTGGGTGA AATCTTCGTT 

651 TGCCCCGGGT T CAAAAGT AG CCGAAATCTA TTTGAAAGAA GCAGATCTGC 

701 TGCCCGAAAT GGAAAAACTC GGCTTCGGTA TCGTTGCCTT CGCATGTACC 

751 ACCTGTAACG GCATGAGCGG CGCGCTGGAT CCGAAAATCC AGAAAGAAAT 

801 CATCGACCGC GATTTGTACG CCACCGCCGT ATTGTCAGGC AACCGCAACT 

851 TTGACGGCCG TATCCATCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 

901 CCGTTGGTCG TTGCCTACGC GCTGGCAGGC AGCATCCGTT TCGATATTGA 

951 AAACGACGTA CTCGGCGTTG CAGACGGCAA AGAAATCCGC CTGAAAGACA 

1001 TTTGGCCTAC CGATGAAGAA ATCGATGCCA TCGTTGCCGA ATATGTGAAA 

1051 CCGCAG CAAT TTCGCGACGT TTATATCCCG ATGTTCGACA CCGGCACAGC 

1101 GCAAAAAGCA CCAAGCCCGC TGTACGACTG GCGTCCAATG TCTACCTATA 

1151 TCCGCCGCCC ACCTTACTGG GAAGGCGCAC TGGCAGGGGA ACGCACATTA 

1201 AGCGGTATGC GTCCGCTGGC GATTTTGCCC GACAACATCA CCACCGACCA 



m099 .pep 
g099 

m099 .pep 
g099 

mO 9 9 . pep 
g099 

m099 .pep 
g099 

tn099 .pep 
g099 

m099 .pep 
g099 
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1251 TCTCTCGCCA TCCAATGCGA TTTTGGCAAG CAGTGCCGCA GGCGAATATT 
1301 TGGCAAAAAT GGGTTTGCCT GAAGAAGACT TCAACTCTTA CGCAACCCAC 
1351 CGTGGCGACC ACTTGACCGC CCAACGCGCA ACCTTCGCCA ATCCGAAACT 
1401 GTTTAACGAA AT G GTGAGAA ACGAAGACGG CAGCGTACGC CAAGGTTCGC 
14 51 TGGCACGCGT TGAACCCGAA GGCCAAACCA TGCGCATGTG GGAAGCCATC 
}V£ GAAACCTATA TGAACCGCAA ACAGCCGCTC ATCATCATTG CCGGCGCGGA 
1551 CTACGGTCAA GGCTCAAGCC GCGACTGGGC TGCAAAAGGC GTACGCCTCG 
1601 CCGGCGTGGA AGCGATTGTT GCCGAAGGCT TCGAGCGTAT CCACCGCACC 
1651 AACTTGATCG GTATGGGCGT GTTGCCGCTG CAGTTCAAAC CGGGTACCAA 
1701 CCGCCACACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTTGTCGGCG 

1751 AACGCACACC GCGCTGCGAC CTGACCCTTG TGATTCACCG TAAAAACGGC 

1801 GAGACCGTCG AAGTCCCCAT TACCTGCCGC CTCGATACCG CAGAAGAAGT 

1851 GTTGGTATAT GAAGCCGGTG GCGTATTGCA ACGGTTTGCA CAGGATTTTT 

1901 TGGAAGGGAA cdcGGCTTAG 1 1 1 1 

This corresponds to the amino acid sequence <SEQ ID 356 1 ORF 099 a>- 

a099.pep ' 

1 MLGRASMMRL PDIVGVELNG KRKAGITATD IVLALTEFLR KERWGAFVE 

51 FFGEGARSLS IGDRATISNM TPE FGATAAM FAIDEQTIDY LKLTGRDDAO 

101 VKLVETYAKT AGLWADALKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAGKGLAK PYEEPSDGQM PDGAVIIAAI TSCTNTSNPR NVVAAALLAR 

201 NANRLGLQRK PWVKSSFAPG SKVAEIYLKE ADLLPEMEKL GFGIVAFACT 

251 TCNGMSGALD PKIQKEIIDR DLYATAVLSG NRNFDGRIHP YAKQAFLASP 

301 PLWAYALAG SIRFDIENDV LGVADGKEIR LKDIWPTDEE I DAI VAEYVK 

351 PQQFRDVYIP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 

4 01 SGMRPLAILP DNITTDHLSP SNAILASSAA GEYLAKMGLP EEDFNSYATH 

451 RGDHLTAQRA TFANPKLFNE MVRNEDGSVR QGSLARVEPE GQTMRMWEAI 

501 ETYMNRKQPL I I IAGADYGQ GSSRDWAAKG VRLAGVEAIV AEGFERiSrt 

551 NLIGMGVLPL QFKPGTNRHT LQLDGTETYD WGERTPRCD LTLVIHRKNG 

601 ETVEVPITCR LDTAEEVLVY EAGGVLQRFA QDFLEGNAA* J-WlvKNG 



m099/a099 97.5% identity in 639 aa overlap 

mo 9 9 . pep mlgrasi^rLdivgvelngkrqagitatSivialtef^ervvgafJeffgegars ls 

'''''''' I I I I I I I I I I I I I I : I I I I I I I I I I l l i i i i i i i i i i i i i i i i i | | | || | |f 

a099 MLGRAS^MRLPDIVGVELNGKRfCAGITATDIVIALTEFLRKER^I^GA^E FFGEGARSLS 

10 20 3° 40 50 60 



m099.pep ^DRATISNMTPEF^^ 

anqp iJJJi IMIIIIIIIIIIIII 'MIIMinilll|||||MMimMimillll 
^099 IGDRATISNMTPEFGATAAMFAIDEQTIDYLKLTGRDDAQVKLVETYAKTAGLWADALKT 

70 80 90 100 no 120 

m099.pep ™YPRyKFDLSf^ 

a09 9 AVYPRVLKFDLS S VTRNMAGPSN PHARFATADLAGRGLAKPYEE PS DGQMPDGAVI IAAI 

130 140 150 160 170 180 

190 200 210 220 



m099 . pep ^^^^^^^^^^^^'^'^^ A ^^ AR ^'^^^^^^^^^^WVKS S F'A.FCSSKVAEI YLKEAGX.i.E'EMEKL 

= nOQ 1 1 1 " I I I I I I I I I II I I I I I I I | | | | : | | | | | | | | | | | , | | | | | | , , , 7 

a099 TSCTNTSNPRNWAAALl^ANRLGLQRKPWVKSSFAPGSKVAEIYLKEADLLPEME^ 



, an „ - 'WVKSSFAPGSKVAEIYLKEADLLPEME]' ' 

190 200 210 220 230 240 



25 0 260 270 280 ?Qn 

m099 . pep G fGIVAFACTTCNGMSGALDPKIQKEIIDRDLYATAVLSGNRNFDGRIHPYAKQAFLlsP 

250 260 27 0 280 290 300 

™o99 . pep ^wayal^LirfdiendJlgvadgkeiLkdiwpadeeidawaewkpqqfrdvyvp 
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I I I I I I I I I I I M I I I I I M I 1 I I 1 I I I I I I I I I II : I I I I I I : I | | | | | | I I | | | | | : | 
a099 PLVVAYALAGSIRFDIENDVLGVADGKEIRLKDIWPTDEEIDAIVAEYVKPQQFRDVYIP 

310 320 330 340 350 360 

370 380 390 400 410 420 

m09 9.pep MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRGMRPLAILPDNITTDHLSP 
I I I I I I I I 1 I I I I I f I I I I I I I t I I I I I I I I M It I I M I I I I I I I I I I I I I I I I I I I I 
a 0 9 9 MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLSGMRPLAI LPDNITTDHLS P 

370 380 390 400 410 420 

430 440 450 460 470 480 

m099 . pep S^3AILAVSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVKNEDGSVR 
I I M I I I I I I I I I I I I I I I i I I I I M I I I I I I I II I I ( I I I I I I I I I I I I I : I I I I I M 
a099 SNAILASSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVRNEDGSVR 

430 440 450 460 470 480 

490 500 510 520 530 540 

mO 9 9 . pep QGS FARVEPEGETMRMWEAIETYMNRKQPLI I IAGADYGQGSSRDWAAKGVRLAGVEAIV 

I M : I I I I I I I : M II I I I I I E I I M I I I I I I II I I I I I I I I I I I I I I I I I I I M I i I I I 
a09 9 QGSLARVEPEGQTMRMWEAIETYMNRKQPLI I IAGADYGQGSSRDWAAKGVRLAGVEAIV 

490 500 510 520 530 540 

550 560 570 580 590 600 

m099 . pep AEGFERIHRTNLIGMGVLPLQFKPDTNRHTLQLDGTETYDWGERT PRCDLTLVIHRKNG 

M II i I II I 1 M I I I I II I I I It I I I II I I I I I I I I II I I M M I I I I I I I I I I I I M I 
a099 AEGFERIHRTNLIGMGVLPLQFKPGTNRHTLQLDGTETYDVVGERTPRCDLTLVIHRKNG 

550 560 570 580 590 600 

610 620 630 640 

mO 9 9 . pep ET VE V P VT CC L DT AEE VLV YE AGG VLQR FAQ DFLEGNAAX 

I I I I I I : I I I I I I I I I I I I I I I I ! I I I I M I I I I I M I I 
a099 ETVEVPITCRLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 

610 620 630 640 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 357>: 

gl02 . seq 

1 AtgtCCGCCA AAactccgtc gctcttcggc ggcgcgatga Ttatcgccgg 

51 gaaggttatc ggcgcAGgta tgttccccaa ccccaccgcc aacttggggg 

101 acgggttaat aggctcgctg attgtgctgc tgtacacctg gtttccattc 

151 tcctccggcg ccctcatgat tttggaagtc aacacccata acCCccgagg 

201 ggcaAGtttt gacaccATGg tcAAagacct gctcgGaCGc ggctggaaca 

251 tcatcaacgg catcgccgtc gctttggTCc tatacggctc gacctacgcg 

301 tacattttag tcggcggtga cctGACCGCC AAAGGCAtcg GCAgCGCAGT 

351 AGGCGGCAAA ATTTCgctca CCGTCGGACA actcgtcttc tTCGGCATCC 

4 01 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG CTTTACCGGC 

451 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 

501 GGTTGCCGAT GCCAAACCGT CCGTCCTCTT CGACACCCAA GCCCCCGTCG 

551 GCACCGGCTA CTGGATTTAC GCCGCCACCG CCCTGCCCGT CTGCCTCGCT 

601 TCCTTCGGCT TCCACGGCAA CGTTTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGcgcCc aaagtGgCGA aATCcatctg gGcaggtaca ttggTTGCCt 

701 tggtaattta cgtccTCTgg caaaccgcca tCcaaagcaa ccTGCcgcgc 

751 aacgagttcg cCCCcgtgat tgccgccgag aggcaactCT CCGTCCTgaa 

801 tgaaacccTG tccaaattcg cccaaaccgg cgatatggat aAaatattgt 

851 ccctatttcc ctacatggca atcgccacct cctttttagg cgTAACctta 

901 ggcctgtttg acaacatcgc cgacatcttc aaatggaacg acagtatgtc 

951 cgggcggggc accaaaaccg tcgcgctgaa cttcctgccg CCCCtgattt 

1001 cctggctgct cctccccacc ggcttcttta ccgccattgg tgcgtccggc 

1051 ctggcggcaa ccgtctggga ccaagGcatc atccccgcca tgctgctcta 

1101 cgtttccccc caaaaaattG gcGcaggcaa gacttataAa gtttaCGGCG 

1151 gcttgtggct gatgttagtc ttccttttcg gcatcgccaa catcgccgca 

1201 CAGGTATTGA GccaAatgGa ACtcgtCccc GTATTTAAAG GATAA 

This corresponds to the amino acid sequence <SEQ ID 358; ORF 102.ng>: 

gl02 .pep 
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310 



1 MSAKTPSLFG GAMIIAGKVI GAGMFFNFTA NLGDGLIGSL IVLLYTWFPF 

51 SSGALMILEV NTHNPRGASF DTMVKDLLGR GWNIING IAV AL VLYGSTYA 

101 YILVGGDLTA KGIGSAVGGK ISLTVGQLVF FGILAFC VWA SARLVDRFTtt 

151 V LIGGMVLTF IWATGGLV AD AKPSVLFDTQ APVGTGYWIY AATALPVCLA 

201 SFGFHGNVSS LLKYFKGDAP KVAKSIW AGT LVALVIYVLW OT ATOSNT.PP 

251 NEFAPVIAAE RQLSVLNETL SKFAQTGDMD K ILSLFPYMaTaTSF LGVTL 

301 GLFDNIADIF KWNDSMSGRG TKTVALNFLP P LISWLLLFT G FFTAIGASfl 

351 LAATVWDQGI IPAMLLYVSP QKIGAGKTYK VYGGLWLMLV FLFGIANIAA 

4 01 QVLSQMELVP VFKG* 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 359>* 

ml 02 . seq ^ 

1 ATGCCCAACA AAACCCCTTC ACTGTTCGGC GGCGCGATGA TTATCGCCGG 
51 CACGGTCATC GGCGCAGGCA TGCTCGCCAA CCCGACCGCC ACATCCGGCG 

101 TATGGTTTAC CGGCTCGCTG GCCGTGTTGC TGTACACCTG GTTTTCTATG 

151 CTTTCCAGCG GCCTGATGAT TTTGGAAGTC AACACCCATT ATCCGCACGG 

201 CGCAAGTTTC GACACGATGG TCAAAGACCT" GCTCGGACGC GGCTGGAACA 

2 51 TCATCAACGG CATCGCCGTC GCCTTCGTTT TATACCTGCT TACTTACGCT 

301 TATATCTTCG TCGGCGGCGA CCTGACCGCC AAAGGCTTAG GCAGCGCGGC 

351 AGGCGGCGAC GTTTCACTCA CCGTCGGACA ACTCGTCTTC TTCGGCATCC 

4 01 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG CTTCACCGGC 

4 51 GTCCTTATCG GCGGCATGGT ATTGACCTTT ATTTGGGCGG CCGGCGGGCT 

501 GATTGCCGAT GCCAAGCCGT CCGTCCTCTT CGATACCCAA GCCCCCGCCG 

551 GCACAAACTA CTGGATTTAC GCCGCCACCG CCCTGCCCGT CTGCCTCGCT 

601 TCCTTCGGCT TCCACGGCAA CGTCTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGCGCCC AAAGTGGCTA AATCCATCTG GACGGGCACA CTGATTGCGC 

701 TGGTAATTTA CGTCCTCTGG CAAACCGCCA TCCAAGGCAA CCTGCCGCGC 

751 AACGAGTTCG CCCCCGTCAT CGCCGCCGAA GGGCAAGTCT CCGTCCTCAT 

801 CGAAACCCTG TCCAAATTCG CCCAAACCGG CAATATGGAC AAAATATTGT 

851 CCCTGTTTTC CTATATGGCG ATCGCCACCT CGTTTTTAGG CGTAACGCTC 

901 GGACTCTTCG ACTACATCGC CGACATCTTC AAATGGAACG ACAGCATCTC 

951 CGGCCGCACC AAAACCGCCG CGCTGACCTT CCTGCCGCCC CTGATTTCCT 

1001 GCCTGCTCTT CCCCACCGGC TTCGTTACCG CCATCGGCTA CGTCGGCCTG 

1051 GCGGCAACCG TCTGGACAGG CATCATCCCC GCCATGCTGC TCTACCGTTC 

1101 GCGCAAAAAA TTCGGCGCAG GCAAAACCTA TAAAGTTTAC GGCGGCTTGT 

1151 GGCTGATGGT TTGGGTCTTC CTTTTCGGCA TCGTCAACAT CGCCGCACAG 

1201 GTATTGAGCC AAATGGAACT CGTCCCCGTA TTTAAAGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 360* ORF 102>- 

ml02.pep. . 9 

1 MPNKTPSL FG GAMIIAGTVI GAGMLANPTA TSGVWFTGSL AVLLYTWF SM 

51 LSSGLMILEV NTHYPHGASF DTMVKDLLGR GWNII NGIAV AFV LYLLTYA 

101 YIFVGGDLTA KGLGSAAGGD VSLTVGQLVF FGILAFC VWA SART.VHR VTfZ 

151 VLIGGMVLT F IWAAGGLI AD AKPSVLFDTQ APAGT NYWIY A ATALPVCLA 

201 SFGFHGNVSS LLKYFKGDAP KVAKS IWTGT LIALVIYVLW OTa Tnr.MT.^ 

251 NEFAPVIAAE GQVSVLIETL SKFAQTG NMD KILSLFSYMA IA TSFT.rvtt. 

301 GLFDYIADIF KWNDSISGRT KTAALTFLPP LISCLL FPTG FVTATfTYVnT, 

351 AATVWTGIIP AMLLYRSRKK FGAGKTYKV Y GGLWLM VWVF LF GIVNIAAO 

401 VLSQMELVPV FKG* J^-^U 

ml02/gl02 86.0% identity in 415 aa overlap 



g!02 



10 20 30 40 50 60 

ml02 . pep MPNKTPSLFGGAMIIAGTVIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 
I MIIIIIIMIIII MUM: MM: I : Ml I I I I | | | : I • U M I I I 
MSAKTPSLFGGAMIIAGKVIGAGMFPNPTANLGDGLIGSLIVLLYTWFPFSSGALMILEV 
10 20 30 40 50 60 

70 80 90 100 HO 120 

NTHYPHGASFDTMVKDLLGRGWNIINGIAVAFVLYLLTYAYIFVGGDLTAKGLGSAAGGD 
1,1 1 : I H I M M I M U M I I M M U M : I I I M M I : M M M M I :\ \\ • M 
NTHNPRGASFDTMVKDLLGRGWNIINGIAVALVLYGSTYAYILVGGDLTAKGIGSAVGGK 
70 80 90 100 HO 120 



ml 02 .pep 
gl02 

ml02 .pep 
gl02 

ml 02 .pep 



130 140 150 160 170 180 

VSLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWAAGGLIADAKPSVLFDTO 
: ''''''''' I I I ' I ' I 1 H M I I I I I I [| | | | [ | | | | | | | I) : | | | : | | | | | j I | . I | | 

ISLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWATGGLVADAKPSVLFDTO 
130 140 150 160 170 180 

190 200 210 220 230 

APAGTNYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVIYVLW 



BNSDOCID: <WO 9957280A2J_> 



WO 99/57280 
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I I : I I : I I I I I I I I II M I I I I I I I I I I I I I I I I I 1 I I I I I I I I I t I : I I I : I I I I I I II 
gl02 APVGTGYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWAGTLVALVIYVLW 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml02 . pep QTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 
I I I I I : I I I I I I I I I I I I I I I : I I I 1111111111:11111111 I I I I I I I I I I I I! 
gl02 QTAIQSNLPRNEFAPVIAAERQLSVLNETLSKFAQTGDMDKILSLFPYMAIATSFLGVTL 

250 260 270 280 290 300 

310 320 330 340 350 

ml02 .pep GLFDYIADIFKWNDSISGR-TKTAALTFLPPLISCLLFPTGFVTAIGYVGLAATVWT-GI 
I I I I I I I I I I I I I I : I I I I I I : I I : I I I I I I I I I : I I I I I I I I I I I I I I I II 
g!02 GLFDNIADIFKWNDSMSGRGTKTVALNFLPPLISWLLLPTGFFTAIGASGLAATVWDQGI 

310 320 330 340 350 360 

360 370 380 390 400 410 

ml02 . pep I PAMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIVNIAAQVLSQMELVPVFKGX 

I I I I M I I : I : I I I I I I I I I I I I I I I : I I I I I I : I I I I I I I 11 I I I I i I I I I I 
gl02 IPAMLLYVSPQKIGAGKTYKVYGGLWLML-VFLFGIANIAAQVLSQMELVPVFKGX 
370 380 390 400 410 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 361>: 
al02 . seq 

1 ATGCCCACCA AAACCCCTTC ACTGTTCGGC GGCGCGATGA TTATCGCCGG 

51 CACGNTCATC GGCGCAGGTA TGCTCGCCAA CCCGACCGCC ACATCCGGCG 

101 TATGGTTTAC CGGCTCGCTG GCCGTGTTGC TGTACACCTG GTTTTCCATG 

151 CTCTCCAGCG GCCTGATGAT TTTGGAAGTC AACACCCACT ACCCCCACGG 

201 CGCGANCTTC GACACCATGG TTAAAGACCT GCTCGGACGG AGCTGGAACA 

251 TCATCAACGG CATCGCCGTC GCCTTCGTTT TATACCTGCT TACTTACGCT 

301 TATATCTTCG TCGGCGGCGA CCTGACCGCC AAAGGCTTAG GCAGCGCGGC 

351 AGGCGGCAAT GTTTCACTCA CCGTCGGACA ACTCGTCTTC TTCGGCATTC 

4 01 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG ATTCACCAGC 

4 51 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 

501 GATTGCCGAT GCCAAACTGC CCGTCCTCTT CGACACCCAA GCCCCTACCG 

551 GCACCAACTA CTGGATTTAT GTCGCCACCG CCCTGCCCGT CTGCCTTGCG 

601 TCATTCGGTT TCCACGGCAA CGTCTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGCGCCC AAAGTGGCTA AATCCATCTG GACGGGCACA CTGATTGCGC 

701 TGGTAATTTA CGTCCTCTGG CAAACCGCCA TCCAANGCAA CCTGCCGCGC 

751 AACGAGTTCG CCCCCGTGAT TGCCGCCGAA GGGCAAGTCT CCGTCNTGAT 

801 TGAAACCCTG TCCAAATTCG CCCAAACCGG CAATATGGAC AAAATATTGT 

851 CCCTGTTTTC CTATATGGCG ATCGCCACCT CGTTTTTAGG CGTAACGCTC 

901 GGACTCTTCG ACTACATCGC CGACATCTTC AAATGGAACG ACAGCGTGTC 

951 CGGCCGCACC AAAACCGCCG CGCTGACCTT CCTGCCGCCT NTAATTTCCT 

1001 GCCTGCTCTT CCCCACCGGC TTTGTTACCG CCATCGGNTA CGTCGGCCTG 

1051 GCGGCAACCG TCTGGACAGG CATCATCCCC GCCATGCTGC TNTACCGTTC 

1101 GCGCAAAAAA TTCGGCGCAG GCAAAACCTA TAAAGTTTAC GGCGGCTTGT 

1151 GGCTGATGGT TTGGGTCTTC CTTTTCGGCA TCNTCAACAT CGCCGCACAN 

1201 GTATTGAGCC AAATGGAACT CGTCCCCGTA TTTAAAGGAT AA 
1202 

This corresponds to the amino acid sequence <SEQ ID 362; ORF 102.a>: 
a 102 .pep 

1 MPTKTPSLFG GAMIIAGTXI GAGML AN PTA TSGVWFTGSL AVLLYTWFSM 

51 LSSGLMILEV NTHYPHGAXF DTMVKDLLGR SWNIINGIAV AFVLYLLTYA 

101 YIFVGGDLTA KGLGSAAGGN VSLTVGQLVF FGILAFCVWA SARLVDRFTS 

151 VLIGGMVLTF IWATGGLIAD AKLPVLFDTQ APTGTNYWIY VAT AL P VC LA 

201 SFGFHGNVSS LLKYFKGDAP KVAKSIWTGT LIALVIYVLW QTAIQXNLPR 

251 NEFAPVIAAE GQVSVXIETL SKFAQTGNMD KILSLFSYMA IATSFLGVTL 

301 GLFDYIADIF KWNDSVSGRT KTAALTFLPP XISCLLFPTG FVTATGYVGL 

351 AATVWTGIIP AMLLYRSRKK FGAGKTYKVY GGLWLMVWVF LFGIXNIAAX 

4 01 VLSQMELVPV FKG* 



ml02 / al02 



95.9% identity in 413 aa overlap 
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ml 02 .pep 
a!02 



10 20 30 40 50 60 

ml 02. pep MPNKTPSLFGGAMIIAGTVIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 
M: I I I M I I I 1 ( I I I II I I f | M I I M M I I I M I II I I | || | | | M I II I I IN I I I 
al02 MPTKTPSLFGGAMIIAGTXIGAGMIANPTATSGVWFTGSLAVLLYTWFSMLSSGLHILEV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml02 . pep NTHYPHGASFDTMVKDLLGRGWNIINGIAVAFVLYLLTYAYIFVGGDLTAKGLGSAAGGD 
11111111 '"I III 1111:11 ill! Ml MINIMUM! II II I I Ml II Mill- 
ar NTHYPHGAXFDTMVKDLLGRSWN I ING I AVAFVLYLLTYAYI FVGGDLTAKGLGSAAGGN 

70 80 90 100 HO 120 

130 140 150 160 170 180 

VSLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWAAGGLIADAKPSVLFDTO 
1 1 ' I M M II M I M I I I I I II I I I II II : M M II M II II I : | | | M I M MINI 
VSLTVGQL VFFG I LAFCVWAS ARLVDRFT S VL I GGMVLT FI WATGGL I ADAKL PVL ^DTQ 
130 140 150 160 170 "i80 

190 200 210 220 230 240 

ml02 . pep APAGTNYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVIYVLW 
( I : 1 I 1 M I I : M M I II M II I I I I II M | I I II II I M M I I I I I I I I I M M II I II 
al02 APTGTNYWIYVATALPVCLAS FGFHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVI YVLW 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml02 . pep QTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 
11111 M I I II I I II I I II I II I! I II II I I I I I I I M II I | | i | | | M | M I M I I I 

QTAIQXNLPRNEFAPVIAAEGQVSVXIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 
250 260 270 280 290 300 

310 320 33 0 340 350 360 

ml02.pep GLFDYIADIFKWNDSISGRTKTAALTFLPPLISCLLFPTGFVTAIGYVGLAATVW^GIIP 
' I I I I I I I I I I I II I : I I I I I M I I I | I | I MMMIMIMMIIIMMMMIMI 
al02 GLFDYIADIFKWNDSVSGRTKTAALTFLPPXISCLLFPTGFVT^ 

310 320 330 340 350 360 

370 3 80 390 400 410 

ml 02 . pep AMLLYRSRKKFGAGKTYICVYGGLWLMVWVFLFGIVNIAAQVLSQMELVPVFKGX 
I ' I I M I I I I M M I I I M I I II M I M I I I M I MM I I I M II M I I I I I 
al02 AMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIXNIAAXVLSQMELVPVFKGX 

370 380 390 400 410 



al02 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 363>* 

glOS.seq ^ 

1 Atgtccgcag aaaCATACAc acAAAtcggc tGGgtaggct taggGcaaat 

51 gGgtctgcct atgGTAACGC GGCTCTTGGA CGGCGGCATC GAAGT CGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCcgc CAAAGGAGCA 

151 AAAGTTTACG GCagcACCGC CGAACTCGTC CGCGCCTGCC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

3 01 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 
351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG ACCCGCCACC AACGGCACAC 

4 01 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 
451 ATATTTTCCC TTGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 
501 AGGCTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTAGGC ATTTTCGGCG 
551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 
601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCTATGTT 
651 TCAAACAAAA AAATCACTAT GGGCAAACCG TGAGTTCCCC CCTGCCTTTG 
701 CACTCAAACA CGCTTCCAAA GAcctTAACC TCGccgtcAA AGAGCTTGAA 
751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 
801 CAAAGCAGTT GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 
851 TGAAATTGGC AGAACACTGA 
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This corresponds to the amino acid sequence <SEQ ID 364; ORF 105.ng>: 

gl05 .pep 

1 MSAETYTQIG WVGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGSTAELV RAC PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 365>: 

ml05 . seq 

1 ATGTCCGCAA ACGAATACGC ACAAATCGGC TGGaTAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGG CGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGT C CGCGACTATC CCGTCATTTT 

2 01 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAm ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAaGC TCGCCGTCAA AGCACTTGTC GAAGCGCAGm GaCAGTTTGC 

351 CGAAGCACCC GTTTCCGGAT CGGTCGGGCC CGCCACCAAC GGCACGCTGC 

401 TGATTCTGTT CGGCGGCAGC GAAcCGtTTT AAACCCGCTG CAAAAAATAT 

4 51 TTTCCCTCGT CGGCAAAAAA ACCTTCCATT TCGGCGATGT CGGCAAAGGT 

501 TCGGGCGCGA AACTCGTCTT GAACTCGCTC TTGGGCATTT TCGGCGAaCG 

551 TAcAGCGAAs GmTg CTGATG GCGCGGCAGT TCGGCATCGA TACCGACACC 

601 AT CGTCGAAG CCATCGGsGA CTCGGCAATG GACTCGCCCA TGTTCCAAAC 

651 CAAAAAATCC CTGTGGGCAA ACCGCGAATT CCCGmCCGmC TTCGCCCTCA 

701 AACACGCCTC CAAAGACCTC AACCTCGCCG TCAAAGAG CT TGAACAGGCA 

751 GGCAACACCC TGCCCGCCGT CGAAACCGTT GCTGCC AG CT ACCGCAAAGC 

8 01 AGTCGAAGCC GGCTACGGGA CACAGGACGT TTCCGGCGTT TACCTGAAAC 

851 TGGCAGAACA CTGA 

This corresponds to the amino acid sequence <SEQ ID 366; ORF 105>: 

ml 05 .pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGX IIVNMSTISP 

101 TE KLAVKALV EAQRQFAEAP VSGSVGPATN GTLLILFGGS EPFXTRCKKY 

151 FPSSAKKPSI SAMSAKVRAR NSSXTRSWAF SANVQRXXLM ARQFGIDTDT 

201 IVEAIGDSAM DSPMFQTKKS LWANREFPXX FALKHASKDL NLAVKELEQA 

251 GNTLPAVETV AASYRKAVEA GYGTQDVSGV YLKLAEH 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 105 shows 79.9% identity over a 289 aa overlap with a predicted ORF (ORF 105 .ng) 
from TV. gonorrhoeae: 

ml05/gl05 

10 20 30 40 50 60 

gl 0 5 . pep MS AETYTQ I GWVGLGQMGL PMVTRLLDGG I EVGVYNRS PDKT AP I S AKGAKVYGS TAELV 

Ilh Ml I MM 1 1 I II 1 1 MM Ml M Ml I I M I II III I M I M II I M ^M I M 

ml 0 5 MSANEYAQ I GW I GLGQMGL PMVTRLLDGG I EVGVYNRS PDKTAP I S AKGAKV YGNTAELV 

10 20 30 40 50 60 

70 80 90 100 110 120 

gl05 . pep RACPVI FLMVSDYAAVCDILNGVRDGLAGKI IVNMSTI SPTENLAVKALVEAAGGQFAEA 

I I II 1 1 1 1 1 I M 1 1 1 1 1 M M I M M I 1 1 II II 1 1 1 MM I I M II I Mill 

ml 0 5 RDYPVI FLMVSD Y AAVCD I LNGVRDGLAGX 1 1 VNMS T I S PTE KLAVKALVEAQR - QFAEA 

70 80 90 100 110 

130 140 150 160 170 180 

glO 5 . pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

I II M !M M II I I I III I Mi : : | | Ml :: - M 
ml 0 5 PVSGS VGPATNGTLLI LFGGSEPFXTRCKKYFPSSAKKP -SI SAMSAKVRARNSSXTRSW 

120 130 140 150 160 170 
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gl05 -pep 
ml05 

gl05 .pep 
ml05 



iyU 200 21 ° 220 230 240 

IFGEAYSEAML^QFG^ 

|: :: NIIIIIMINMIMI IIIIIIIIIMIIIIMIMI 1 1 1 1 1 1 J l 

AFSANVORXXLMARQFGIDTDTIVEAIGDSA^SPMFQTKKS™^ 
180 190 2 °0 210 220 230 



250 260 270 280 289 

DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKIAEH 

' 1 1 1 M 1 1 1 M 1 1 1 1 1 1 M I N 1 1 1 1 1 1 1 J 1 1 1 1 1 IIMIIMIMM 

DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGTQDVSGVYLKLAEH 
240 250 260 270 280 



451 
501 
551 
601 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 367>- 

a.105 . seq 

1 ATGTCCGCAA ACGAATACAC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

1 m ^Inlt^l ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 
101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGPA 
151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC C^JSSS 

2 01 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 
251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 
301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAgS 
351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGgScS 

401 tgctgattct gttcggcggc agcgaagccg ttttaaaccc gctgcSSS 

4M ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCCGCAA 
AGGTTCGGGC gcgaaactcg TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 
AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 
^ACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCCATGTT 
n 5 ^ACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCA CCCGCCTTCG 
701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 
751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACrr 
801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 
851 TGAAATTGGC AGAACACTGA ALL 

This corresponds to the amino acid sequence <SEQ ID 368: ORF 105 a>- 
al05.pep ' 

1 MSANEYTQIG W1GLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAE LV RD YPVIFLMV SDYAAVCDTT. NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG I FGEAYSEAM LMARQFGIOT 

11, nlrZnli^ ™ DSmF °™ KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 

ml05/al05 96.5% identity in 289 aa overlap 
na05.pep ^ANEYAQIGWIGLGQMGLP^ 

1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m , | ,77, , 

al05 MSANEYTQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPIS^ 

10 20 30 40 50 60 



ml05 . pep RDYPVIFIjWSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAG-OFAEA 

al05 RDYPVIFLWSDYAAVCDILNGVRDGIAGKIIVNMSTISPTENLAVra 

70 80 90 100 no 120 



m l05.pep 12 ^S^^ 
P^Gi ^ 

140 150 160 170 leo 

180 190 200 210 220 230 



BNSDOCID: <WO_8957260A2J_> 
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ml05 . pep I FGDV-QRXMLMARQFG I DTDT I VEAI GDS AMDS PMFQTKKSLWANREFPXAFALKHASK 

til:: : : I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
al05 IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 

190 200 210 220 230 240 

240 250 260 270 280 

ml 05 . pep DLNLAVKELEQAGNTLPAVETVAAS YRKAVEAGYGEQDVSGVYLKLAEHX 

I I I I I I I I I I I I I I M I I I I M M I I I I I I I I II I I I I I I I I I I I I I M I 
a 1 0 5 DLNLAVKELEQAGNTLPAVETVAAS YRKAVEAGYGEQDVSGVYLKLAEHX 

250 260 270 280 290 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 369>: 

gl05-l . seq 

1 ATGTCCGCAG AAACATACAC ACAAATCGGC TGGGTAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGAGCA 

151 AAAGTTTACG GCAGCACCGC CGAACTCGTC CGCGCCTGCC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 AC C G AAA AC C TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG ACCCGCCACC AACGGCACAC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TTGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGCTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTAGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCTATGTT 

651 TCAAACAAAA AAATCACTAT GGGCAAACCG TGAGTTCCCC CCTGCCTTTG 

701 CACT CAAAC A CGCTTCCAAA GACCTTAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTT GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ID 370; ORF 105-l.ng>: 

gl05-l .pep 

1 MSAETYTQIG WVGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGSTAELV RACP VIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 37 1>: 

ml05-l . seq 

1 ATGTCCGCAA AC GAATACGC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

4 01 TGCTGATTCT GTTCGGCGGC AGCGAAGCcG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAnCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGsGACTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCG CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAACTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ID 372; ORF 105-1>: 

mi05-l.pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVI FLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 



BNSDOCID: <WO 9957280A2J_> 



WO 99/57280 
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101 

151 
201 
251 



TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 
IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEXM LMARQFGIDT 
DTIVEAIGDS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 
QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 



ml05-l/gl05-l 96.9% identity in 289 aa overlap 



ml05-l, 



gl05-l 



pep 



ml05-l -pep 
gl05-l 



ml05-l. 



gl05-l 



ml05-l. 



gl05-l 



pep 



pep 



10 20 30 40 50 60 

MSANEYAQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
1 1 : 1 : M I I : I I I I 1 I ! I | | | I f | j I M I I I M I I I I | ] I | | | | | | | 1 | | | | : | j | | f 
MSAETYTQIGWVGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGSTAELV 
10 20 30 40 50 60 

70 80 90 100 110 120 

RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 
' " t M I I I I i I I I I I I i | I | | | | | j M | | i | | | | | 1 | | | | | i , | | ,,,{,,,,, , 

RACPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 
70 80 90 100 110 120 

!30 140 150 160 170 180 

PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
' " I ' < I I I f i I I I I I I j I I I 1 | | | 1 | M j I M I I I I I ) f | | | | | f | | ! | | I | | | j [ | , , 

PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
130 140 150 160 170 180 

19 ° 200 210 220 230 240 

1FGEAYSEXMLMARQFGIDTDT I VEAIGDSAMDSPMFQTKKSLWAN RE FP PAFALKHASK 
I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I t 1 I I I I I I | ! | | | | j | [ | [ [ | j ; | ; [ { | | | 

IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 
190 200 210 220 230 



240 



ml05-l .pep 
gl05-l 



250 260 270 280 290 

DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
H i I I I I I I M I I I I I I 1 I I I I I I ! | | | | ) I f I ] j j | j | j I I | | j | | | | | 
DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 373>- 

al05-l . seq 

1 ATGTCCGCAA ACGAATACAC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA AC AT GAG C AC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CAT C GAT AC C 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCA CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 



This corresponds to the amino acid sequence <SEQ ID 374; ORF 105-La>: 

al05-l .pep 

1 MSANEYTQIG WIGLGQMGLP MVTRLLDGGI 

51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS 



EVGVYNRSPD 
NGVRDGLAGK 
NGTLLILFGG 
IFGEAYSEAM 
PAFALKHASK 
GVYLKLAEH* 



KTAPISAKGA 
IIVNMSTISP 
SEAVLNPLQK 
LMARQFGIDT 
DLNLAVKELE 



al05-l/ml05-l 99.0% identity in 289 aa overlap 



al05-l.pep 



10 



20 



30 40 50 60 

MSANEYTQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
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ml 0 5 - 1 MS ANE YAQIGW I GLGQMGL PMVTRLLDGG I EVGVYNRS PDKTAP I S AKGAKV YGNTAEL V 

10 20 30 40 50 60 

70 80 90 100 110 120 

al05-l .pep RDY PVI FLMVS D YAAVCDI LNGVRDGLAGKI I VNMS T I S PTENLAVKALVEAAGGQFAEA 
I I M I I I I I i I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I f I I I 1 I I I ! I M I I I 
ml 0 5 - 1 RDY PVI FLMVS D YAAVCDI LNGVRDGLAGKI I VNMSTI S PTENLAVKALVEAAGGQFAEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

al05-l . pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
I I I M i ! I I I I I I I I I I t I I I I I t I I 1 I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I 
ml05-l PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 

190 200 210 220 230 240 

al05-l .pep IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
ml 0 5 - 1 I FGE A YSEXMLMARQFG I DTDT I VEAI GDSAMDS PMFQTKKSLWANREFPPAFALKHASK 

190 200 210 220 230 240 

250 260 270 280 290 

al05-l . pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 

II I I I I 1 I I I I II I I II I I 1 1 I I I I M I I I I I I i I I I I I I I I I I I I I I I I 
ml 0 5 - 1 DLNLAVKELEQAGNTLPAVETVAAS YRKAVEAGYGEQDVSGVYLKLAEHX 

250 260 270 280 290 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 375>: 

gl07 . seq 

1 ATGGTATTAA CCTTTATTTG GGCAACCGGC GGCCTGGTTG CCGATGCCAA 

51 ACCGTCCGTC CTCTTCGACA CCCAAGCCCC CGTCGGCACC GGCTACTGGA 

101 TTTACGCCGC CACCGCCCTG CCCGTCTGCC TCGCTTCCTT CGGCTTCCAC 

151 GGCAACGTTT CCAGCCTGCT CAAATACTTT AAAGGCGACG cgcCcaaagt 

201 Gg CGAaATCc atctggGcag gtacattggT TGCCttggta atttacgtcc 

251 TCTggcaaac cgccatCcaa agcaaccTGC cgcgcaacga gttcgcCCCc 

301 gtgattgccg ccgagaggca actCTCCGTC CTgaatgaaa cccTGtccaa 

351 attcgcccaa accggcgata tggataAaat attgtcccta tttccctaca 

401 tggcaatcgc cacctccttt ttaggcgTAA Ccttaggcct gtttgacaac 

451 atcgccggac atcttcaaat ggaacgacag tatgtccggg cggcaccaaa 

501 accgtcgcgc tga 

This corresponds to the amino acid sequence <SEQ ID 376; ORF 107.ng>: 

gl07 .pep 

1 MVLTFIWATG GLVADA KP5V LFDTQAPVGT GYWIYAATAL PVCIiASFGFH 

51 GNVSSLLKYF KGDAP KVAKS I W AGTLVALV IYVLWQTAI Q SNLPRNEFAP 

101 VIAAERQLSV LNETLSKFAQ TGDMD KILSL FPYMAIATSF LGV TLGLFDN 

151 IAGHLQMERQ YVRAAPKPSR * 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 377>: 

ml07 . seq 

1 ATGGTATTGA CCTTTATTTG GGCGGCCGGC GGGCTGATTG CCGATGCCAA 

51 GCCGTCCGTC CTCTTCGATA CCCAAGCCCC CGCCGGCACA AACTACTGGA 

101 TTTACGCCGs CACCGCCCTG CCCGTCTGCC TCGCTTCCTT CGGCTTCCAC 

151 GGCAACGTCT CCAGCCTGCT CAAATACTTT AAAGGCGACG CGCCCAAAGT 

201 GGCTAAATCC ATCTGGACGG GCACACTGAT TGCGCTGGTA ATTTACGTCC 

251 TCTGGCAAAC CGCCATCCAA GGCAACCTGC CGCGCAACGA GTTCGCCCCC 

301 GTCATCGCCG CCGAAGGGCA AGTCTCCGTC CTCATCGAAA CCCTGTCCAA 

3 51 ATTCGCCCAA ACCGGCAATA TGGACAAAAT ATTGTCCCTG TTTTCCTATA 

401 TGGCGATCGC CACCTCGTTT TTAGGCGTAA CGCTCGGACT CTTCGACTAC 

451 ATCGCCCATC TTCAAATGGA ACGACAGCAT CTCCGGgCCG CACCAAAACC 

501 GCCGCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 378; ORF 107>: 

ml07 .pep. . 

1 MVLTFIWAAG GLIADA KPSV LFDTQAPAGT NYW I YAXTAL PVCLASFGFH 

51 GNVSSLLKYF KGDAP KVAKS I WTGTLIALV IYVLWQTAI Q GNLPRNEFAP 



BNSDOCID: <WO 9957280A2J_> 



WO 99/57280 ^_ 

PCT/US99/09346 

318 



101 VIAAEGQVSV LIETLSKFAQ TGNMD KILSL F5YMAIATS F LGVTLGLFDY 
151 IAHLQMERQH LRAAPKPPR* ' ~ 

Computer analysis of this amino acid sequence gave the following results- 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 107 shows 89.4% identity over a 170 aa overlap with a predicted ORF fORP 107 mrt 
from TV. gonorrhoeae: BJ 

ml07/gl07 



ml 07 .pep 
9107 

ml07 .pep 
g!07 



10 20 30 40 50 60 

MVLTFIWAAGGLIADAKPSVLFDTQAPAGTNYWIYAXTALPVCLASFGFHGNVSSLLKYF 

lllllll|:|,MIMIIIII 'iiii:ii:iiiii MiiiMiiiimmiMMi 

WLTFI WATGGLVADAKPSVLFDTQAPVGTGYWI YAATALPVCLAS FGFHGNVS S LLKYF 
10 2C) 30 40 50 60 

KGDAPKVAKSIWAGTLVALVIYVLWQTAIQSNLPRNEFAPVIAAER^^^ 

70 80 90 100 no i 2 o 



130 140 150 160 170 

ml 0 7 . pep TGNMDKILSLFS YMAIATSFLGVTLGLFDYIA- HLQMERQHLRAAPKPPR 

IMIIIMII [IIIIIIMIIIMIII 1 1 1 1 1 1 1 1 1 = = 1 1 1 1 1 1 I 

TGDMDKILSLFPYMAIATSFLGVTLGLFDNIAGHLQMERQWRAAPKPSR 
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gl07 



Il07 f ° 11OWinS paitial DNA se< l uence was identified in N. meningitidis <SEQ ID 379> . 



1 


ATGGTATTAA 


51 


ACTGCCCGTC 


101 


TTTATGTCGC 


151 


GGCAACGTCT 


201 


GGCTAAATCC 


251 


TCTGGCAAAC 


301 


GTGATTGCCG 


351 


ATTCGCCCAA 


401 


TGGCGATCGC 


451 


ATCGCCGACA 


501 


CGCCGCGCTG 


551 


CCGGCTTTGT 


601 


ACAGGCATCA 


651 


CGCAGGCAAA 


701 


TCTTCCTTTT 


751 


GAACTCGTCC 



This corresponds to the amino acid sequence <SEQ ID 380- ORF 107 a>- 
al07.pep ' ' ' 

1 MVLTFIWATG GLIADAKLPV LFDTQAPTGT NYWIYVATAL PVCLASFGFH 

51 GNVSSLLKYF KGDAPKVAKS IWTGTLIALV IYVLWQTAIQ GNLPRNEFAP 

101 VIAAEGQVSV LIETLSKFAQ TGNMDKILSL FSYMAIATSF LGVTLGLFDY 

151 IADIFKWNDS VSGRTKTAAL TFLPPLISCL LFPTGFVTAI GYVGLAATVW 

201 TGIIPAMLLY RSRKKFGAGK TYKVYGGLWL MVWVFLFGIV NIAAQVLSQM 

251 ELVPVFKG* 

ml07/al07 94.8% identity in 154 aa overlap 
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The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 38 1>: 

gl08 . seq 

1 ATGttgccgg gCTTCAACCG GATATTCAa a cggTTTGCTC CAACACTCGG 

51 AAcggCGCAT AAAACGCCgc CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATCCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGCG GCAATCTTAC TCGGCGGGTG 

201 CGCCGCCGGC GGCAACACAT TCGGCAGCTT AGACGGCGGC ACGGGTATGG 

251 GTGGCAGCAT CGTCAAAATG ACGGTAGAAA gccAATGCCG TGCGGAATTG 

3 01 GACAGGCGCA GCGAATGGCG TTTGACCGCG CTGGCGATGA GTGCCGAAAA 
351 ACAGGCGGAA TGGGAAAACA AGATTTGCGG CTGCGCTACC GAAGAAGCAC 

4 01 CTAACCAGCT GACCGGCAAC GATGTGATGC AGATGCTGAa ccagtCCacG 
451 CGCaatcagg cacTtgccgc CCtgaccgTC AAAacggtTT CcgcctgcTT 
501 CAaacgcctg tACCGCTAa 

This corresponds to the amino acid sequence <SEQ ID 382; ORF 108.ng>: 

gl08 .pep 

1 MLPGFNRIFK RFAPTLGTAH KTPPFALSRT GRLIRSYRHK RRGFNRKGIE 

51 MNKT LSILPA AILLGGCAAG GN TFGSLDGG TGMGGSIVKM TVESQCRAEL 

101 DRRS EWRLTA LAMSAEKQAE WENKI CGCAT EEAPNQLTGN DVMQMLNQST 

151 RNQALAALTV KTVSACFKRL YR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 383>: 

ml08 . seq 

1 ATGTTGCCGG GCTTCAACCG GATATTCAAA CGGTTTGTTC CAACACTCGG 

51 AACGGCGCAT AAAACGCCGC CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATTCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGTG GCAATCTTAC TCGGCGGCTG 

201 CGCCGCCGGA GGCGGTAACA CATTCGGCAG CTTAGACGGT GGCACAGGCA 

251 TGGGCGGCAG CATCGTCAAA ATGG CGGTTG GGAGCCAATG CCGTGCGGAA 

301 TTGGACAAAC GCAGCGAATG GCGTTTGACC GCGCTGGCGA TGAGTGCCGA 

351 AAAACAGGCG GAGTGGGAAA ACAAGATTTG CGCTTGCGTC GCCCAAGAAG 

4 01 CACCCGAACG GATGACCGGC AACGATGTGA TGCAGATGCT GGCTCCGTCC 

4 51 ACGCGCAATC AGGCACTTGC CGCCCTGACC GCCAAAACGG TTTCCGCCTG 

501 CTTCAAACAC CTGTACCGCT AA 

This corresponds to the amino acid sequence <SEQ ID 384; ORF 108>: 

ml08 .pep 

1 MLPGFNRIFK RFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 

51 MNKT LSILPV AILLGGCAAG G GNTFGSLDG GTGMGGSIVK MAVGSQCRAE 

101 LDKRS EWRLT ALAMSAEKQA EWENKICACV AQEAPERMTG NDVMQMLAPS 

151 TRNQALAALT AKTVSACFKH LYR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from 7V. gonorrhoeae 

ORF 108 shows 89.6% identity over a 173 aa overlap with a predicted ORF (ORF 108.ng) 
from N. gonorrhoeae: 

ml08/gl08 
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mlOB . pep MLPGFNRIFKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSILPV 

MMIIIMIIhMIIIIIIIMIIIMIIIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 M I ! 1 1 1 1 1 ! : 

gl08 MLPGFNRIFKRFAPTLGTAHKTPPFALSRTGRLIRSYRHKRRGFNRKGIEmKTLSILPA 
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ml 08. pep AILLGGCAAGGGNTFGSLDGGTGMGGSIVKMAVGSQCRAELDKRSEWRLrTALAMSAEKOA 

1 1 1 I 1 1 1 1 1 i I IIIIIMMIIMIIIII|:| IIMIIIhlMMIIMIIIIIM! 

glOS AILLGGCAAGG-NTFGSLDGGTGMGGSIVKMTVESQCRAEI1DRRSEWRLTAI1AMSAEKQA 

? 0 80 90 100 no 
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ml 0 8 . pep EWENKICACVAQEAPERMTGNDVMQMLAPSTRNQALAALTAKTVSACFKHLYRX 

NlliM:|: = :|i|:::||||||||| I I I I | || I I M : I I || I I I I : I | | 
9108 EWENKI CGCATEEAPNQLTGNDVMQMLNQSTRNQALAALTVKTVSACFKRLYRX 

12° 130 140 150 160 170 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 385>* 
al08.seq 

1 ATGTTGCCGG GCTTCAACCG GATATT CAAA CGGTTTGTTC CAACACTCGG 

51 AACGGCGCAT AAAACGCCGC CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATTCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGTG GCAATCTTAC TCGGCGGCTG 

201 CGCCGCCGGG GGCGGTAACA CATTCGGCAG CTTAGACGGC GGCACAGGTA 

251 TGGGCGGCAG CATCGTCAAA ATGGCGGTAG AAAGCCAATG CCGTGCGGAA 

301 TTGAACAAAC GCAGCGAATG GCGTTTGACC GCGCTGGCGA TGAGTGCCGA 

351 AAAACAGGCG GAATGGGAAA ACAAGATTTG CGCTTGCGTC GCCCAAGAAG 

4 01 CACCCAACCA GCTGACCGGC AACGATGTGA TGCAGATGCT GGATCCGTCC 

4 51 ACGCGCAATC AGGCACTTGC CGCCCTGACC GCCAAAACGG TTTCCGCCTG 

501 CTTCAAACAC CTGTACCGCT AA 

This corresponds to the amino acid sequence <SEQ ID 386; ORF 108.a>: 
al08.pep 

1 MLPGFNRIFK RFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 
51 MNKT LS1LPV AILLGGCAAG G GNTFGSI.nn GTGMGGSIVK MAVESQCRAE 
101 LNKRSEWRLT ALAMSAEKQA EWENKICACV AQEAPNQLTG NDVMQMLDPS 
151 TRNQALAALT AKTVSACFKH LYR* 

ml08/al08 96.5% identity in 173 aa overlap 

10 20 30 40 50 60 

ml08 . pep MLPGFNRIFKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSILPV 
M 1 1 1 1 ' M ' 1 M > I I I I I I I I ! | l M I I M I I II I I I | f I | | | I | | | [ i | f j | , ,, , 

MLPGFNRIFKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSILPV 
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70 80 90 100 110 120 

AILLGGCAAGGGNTFGSLDGGTGMGGSIVKMAVGSQCRAELDKRSEWRLTALAMSAEKOA 
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AILLGGCAAGGGNTFGSLDGGTGMGGSIVKMAVESQCRAELNKRSEWRLTALAMSAEKQA 
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al08 

ml 0 8 . pep 
al08 



130 140 150 160 170 

ml 08 . pep EWENKICACVAQEAPERMTGNDVMQMLAPSTRNQALAALTAKTVSACFKHLYRX 
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EWENKI CACVAQEAPNQLTGNDVMQMLDPSTRNQALAALTAKTVSACFKHLYRX 
130 140 150 160 170 



al08 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 387>: 
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gl09 . seq 

1 ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GATCGTAGGC GTATGCTTAC CGCTTTTGGA AGCGGGCATG 

101 GAAATGACGC GCAAAGGCAA AACCACCCAA TCCGCCGCCA TCGTGGTGTT 

151 CTCTTCCGTC TGGTCAATCC GGTTTTCGGC TGGGCGTTGA CGATGCTGTT 

201 GGATAATTTG GGCTTAATCG GCTGCAAAGA ACGCAGCGCG CAATTAGGTT 

2 51 TTGTCGGACG AGTATTGATA CCCGCAGTAG GTTTCTTAAT CTTGTGTGTG 

3 01 GCGATGGGTG CGGT CGGGAT GCTGCCCGGT ATCCCTCCGT TTTTGGAGCA 
351 GTTCAAATCT TTGGGCTAG 

This corresponds to the amino acid sequence <SEQ ID 388; ORF 109.ng>: 

gl09 .pep 

1 MYYRRWGLS DGLGDLAAGI DRRRMLTAFG SGHGNDAQRQ NHPIRRHRGV 

51 LFRLVNPVFG WALTMLLDNL GLIGCKERSA QLGFVG RVLI PAVGFLILCV 

101 AMGAVGMLPG IPPFLEQFKS LG 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 389>: 

ml09 . seq 

1 ATGTATTATC GCCGGGTTAT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 

101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AAAACCACCC AATCCGCCGC 

151 CATCGTGGTG TTCTCTTCCG CCTTGTCAAT CCGGTTTTCG GCTGGGCGTT 

201 GACGATGCTG TTGGATAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGTG 

251 CGCAATTAGG TTTCGCCGGA CGCGTGTTGA TACCCGCAGT AGGTTTCTTG 

301 ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

3 51 GTTTTTGGAA CACTT CAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 4; ORF 109>: 

ml09 -pep 

1 MYYRRVMGLS DGLGDLAAG I ERSLGRRRIL TAFGSGHGND AQRQNHPIRR 

51 HRGVLFRLVN PVFGWALTML LDNLGLIGCK ERSAQLGFAG R VLI PAVGFL 

101 I LCVAMGAV G MLPGIPPFLE HFKSLG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 109 shows 92.9% identity over a 126 aa overlap with a predicted ORF (ORF 109.ng) 
from N. gonorrhoeae: 

ml09/gl09 
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ml09 .pep 
gi09 

ml 09 .pep 
9109 



ml 09. pep HFKSLGX 

M MM 

gl09 QFKSLGX 
120 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 39 1> : 
al09 . seq 

1 ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 
51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 
101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AAAACCACCC AATCCGCCGC 
151 CACCGTGGTG TTCTCTTCCG CTTGGTCAAT CCGGTTTTCG GCTGGGCGTT 
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201 GACGATGCTG TTGGATAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGCG 

251 CGCAATTAGG TTTCACCGGA CGCGTATTGA TACCCGTAGT AGGTTTCTTG 

301 ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

351 GTTTTTGGAG CACTTCAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 392; ORF 109>- 
al09.pep 

1 MYYRRVVGLS DGLGDLAAGI ERSLGRRRIL TAFGSGHGND AQRQNHPIRR 
51 HRGVLFRLVN PVFGWALTML LDNLGLIGCK ERSAQLGFTG RVLIPWGFL 
101 ILCVAMGA VG MLPGIPPFLE HFKSLG* : ~ 

ml09/al09 97.6% identity in 126 aa overlap 
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ml09.pep WYYRRVMGLSDGLGDLAAGIERSLGRRRILTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 
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ml 09. pep HFKSLGX 
Mill!! 

a!09 HFKSLGX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 393>- 

glll.seq ^ 

1 atgccgtctg aaacacgcct gccgaacctt atccgcgcct tgatatttgc 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGaacaaacC GCGCAaaccg 

101 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCtATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCCCCT GCCAAAATAC AAAAGCGCAT 

201 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGtccaCC TACCAGACCG 

251 ATTCCGAAAT CAGCCGGTTt atacagacan atgctggaga gctcttcgcg 

301 tntcatgcag nttctataac tgattccgcc gaagactgtc tgcctaatac 

351 gcctatctca tcggcgctct ga 

This corresponds to the amino acid sequence <SEQ ID 394- ORF 1 1 1 ne>- 
gill. pep ' B 

1 MPSETRLPNL IRALIFALGF IFLWAPBWAt AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEISRF IQTAGELFAH 

101 ASITDSAEDC LPNTPISSAL * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 395>- 

mill . seq ^ 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCGTCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGCGAAACG ATGGGCACGA CCTATAyCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AwAAACGCAT 

201 CGATGACGCG CTTAAAGAAk TCAACCGGyA GATGTCCACC TATCAGCCCG 



251 
301 
351 
401 
451 
501 
551 



ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 
ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC GCCTGAACCG 
CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 
GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 
ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 
AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 
ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
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^ i. vO/vvvvi J. 




651 


GCACGGCAAA 


GGCAAAAACG 


701 


AGCAGCCCAA 


TATCGTCCAA 


751 


AACAACCGTT 


CGCTTGCCAC 


801 


TAAAAACGGC 


AAACGCCTCT 


851 


CCATCAGCCA 


CAACCTCGCC 


901 


ACGGCGGACG 


GCTTGTCCAC 


951 


CTTAAAGCTG 


GCAGAGCGCG 


1001 


ATAAAGGCGG 


CTACCGCACC 


1051 


CGCTAA 





This corresponds to the amino acid sequence <SEQ ID 396; ORF 1 1 1>: 

mill -pep 

1 MPSETRLPNF IRVLIFALGF IFLNA CSEQT AQTVTLQGET MGTTYXVKYL 
51 SNNRDKLPSP AEIXKRIDDA LKEXNRXMST YQPDSEISRF NQHTAGKPLR 
101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 
151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 
201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQTIVPL 
251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISWADSAM 

3 01 TADGLSTGLF VLGETEALKL AEREKLAVFL I VRDKGG YRT AMSSEFEKLL 
351 R* 

ORF 1 1 1 shows 88.7% identity over a 97 aa overlap with a predicted ORF (ORF 1 1 l.ng) 
from N. gonorrhoeae: 

mill .pep/gill .pep 

10 20 30 40 50 60 

mill, pep MPSETRLPNFI RVLI FALGFI FLNACSEQTAQ WTLQGETMGTTYXVKYLSNNRDKLPS P 
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mill .pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQI KQAAS YTGIDKI I LKQGKD YASLSKTHPK 

gill SALX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 397>: 
alll . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAGCGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

4 01 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 
4 51 AT CAAAC AAG CAGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 
501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 
551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 
651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCTTGGCGC ATCGGCATCG 
7 01 AACAGCCCAA CATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 
7 51 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 
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801 TAAAAGCGGC AAACGCCTCT CCCATATCAT TAATCCGAAC AACAAACGAC 

851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGG CGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

1051 CGCT^VA 

This corresponds to the amino acid sequence <SEQ ID 398; ORF 1 1 l.a>* 
alll .pep 

1 MPSETRLPNF IRTLIFALSF IFLNA C5EOT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVHLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEO 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKSG KRLSHIINPN NKRPISHNLA SISWADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R * 



mlll/alll 97.7% identity in 351 aa overlap 
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320 330 340 350 



Si-i^ie-T 118 Partial S6qUenCe WaS identified in N - gonorrhoeae <SEQ ID 399>: 

1 ATGCCGTCTG AAACACGCCT GCCGAACCTT ATCCGCGCCT TGATATTTGC 
SS^" IC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAacCG 
101 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCTATACCGT CAAATACCTT 
201 ™^ TC GGGACAftACT CCCCTCCCCT GCCAAAATAC AAAAGCGCAT 
201 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TACCAGACCG 
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251 ATTCCGAAAT CAGCCGGTTC AACCAACACA 

301 ATTTCAAGCG ATTTCGCACA CGTTACCGCC 

351 CCTGACTCAC GGCGCACTGG ACGTAACCGT 

401 GGGGGTTCGG CCCCGACAAA TCCGTTACCC 

451 ATCAAACAGG CGGCATCTTA TACGGGCATA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG 

651 GCACGGCAAA GGCAAAAATG CGCACGGCGA 

701 AGCAACCCAA TATcatccaa ggcggcaata 

751 aaCaaccgtt cgcttgccac ttccggcgAT 

801 TAAAAACGGC Aaacgccttt cccacATCAT 

851 CCATCAGcCA CAAcctcgcc tcCATCAgCg 

901 ACGGCGGACG GTTTATCCAC AGGATTATTT 

951 CTTAAGGCTG GCAGAACAAG AAAAACTCGC 

1001 ATAAGGACGG CTACCGCACC GCCATGTCTT 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 400; ORF 1 1 l-l.ng>: 

glll-l .pep 

1 MPSETRLPNL IRALI FALGF IFLKAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILQQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNAHGEPWR IGIEQPNIIQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISVVSDSAM 

301 TADGLSTGLF VLGETEALRL AEQEKLAVFL IVRDKDGYRT AMSSEFAKLL 

351 R* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 40 1>: 

mlll-1 . eeq 

1 ATGCCGTCTG AAACACGCCT 

51 CCTGGGTTTC ATCTTCCTGA 

101 TTACCCTGCA AGGCGAAACG 

151 TCAAATAATC GGGACAAACT 

201 CGATGACGCG CTTAAAGAAG 

251 ACTCCGAAAT CAGCCGGTTC 

301 ATTTCAAGCG ACTTCGCACA 

351 CCTGACACAC GGCGCGCTGG 

4 01 GGGGATTCGG CCCCGACAAA 

4 51 ATCAAACAGG CGGCATCTTA 

501 AGGCAAAGAT TACGCTTCCT 

551 ATTTATCTTC GATTGCCAAA 

601 CTGGAAAAAT ACGGCATTCA 

651 GCACGGCAAA GGCAAAAACG 

701 AGCAGCCCAA TATCGTCCAA 

751 AACAACCGTT CGCTTGCCAC 

801 TAAAAACGGC AAACGCCTCT 

851 CCATCAGCCA CAACCTCGCC 

901 ACGGCGGACG GCTTGTCCAC 

951 CTTAAAGCTG GCAGAGCGCG 

1001 ATAAAGGCGG CTACCGCACC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 402; ORF 1 1 1-1>: 

mil 1-1 .pep 

1 MPSETRLPNF IRVLIFALGF IFLNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISWADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 

mlll-l/glll-1 96.6% identity in 351 aa overlap 

10 20 30 40 50 60 

ml 11-1 .pep MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 
I I I M I I I I : I I : I I I I I I I I I I I I I I I I I I I f I I I I I I I I I I I I I I I | I I M I I I I I I I 
glll-1 MPSETRLPNLI RALI FALGFI FLNACSEQTAQTVTLQGETMGTT YTVKYLSNNRDKLPSP 
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CAGCCGGCAA 
GAAGCCGTCC 
CGGCCCTTTG 
GTGAACCGTC 
GACAAAATCA 
CCACCCCAAA 
TTGATAAAGT 
GTCGAAATCG 
ACCGTGGCGC 
cgcAGattat 
taccgtaTTT 
CAATCCCAAC 
TGGTCTCAGA 
GTTTTAGGCG 
TGTTTTCCTA 
CCGAATTTGC 



GCCCCTCCGC 
GCCTGAACCG 
GTCAACCTTT 
GCCGGAACAA 
TTTTGCAACA 
GCCTATTTGG 
TGCGGGCGAA 
GCggcGAGTT 
ATCGGTATAG 
cgtcccgctg 
tccacgtcgA 
AACAAACGAC 
CAGTGCAATG 
AAACCGAAGC 
ATTGTCCGGG 
CAAGCTGCTC 



GCCGAACTTT ATCCGCGTCT TGATATTTGC 
ACGCCTGTTC GGAACAAACC GCGCAAACCG 
ATGGGCACGA CCTATACCGT CAAATACCTT 
CCCCTCACCT GCCGAAATAC AAAAAC GC AT 
TCAACCGGCA GATGTCCACC TATCAGCCCG 
AACCAACACA CAGCCGGCAA GCCCCTCCGC 
CGTTACTGCC GAAGCCGTCC GCCTGAACCG 
ACGTAACCGT CGGCCCCTTG GTCAACCTTT 
TCCGTTACCC GTGAACCGTC GCCGGAACAA 
TACGGGCATA GACAAAATCA TTTTGAAACA 
TGAGCAAAAC CCACCCCAAG GCCTATTTGG 
GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
AAATTATCTG GTCGAAATCG GCGGCGAGTT 
CGCGCGGCGA ACCGTGGCGC ATCGGTATCG 
GGCGGCAATA CGCAGATTAT CGTCCCGCTG 
TTCCGGCGAT TACCGTATTT TCCACGTCGA 
CCCATATCAT CAACCCGAAC AACAAACGAC 
TCCATCAGCG TGGTCGCAGA CAGTGCGATG 
AGGATTATTC GTATTGGGCG AAACCGAAGC 
AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 
GCCATGTCTT CCGAATTTGA AAAACTGCTC 
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10 20 30 40 50 60 

70 80 90 100 110 120 

ml 11-1. pep AEIQKRIDDALKEVNRQMSTYQPDSEISRFN QHTAGKPLRISSDFAHVTAEAVRLNRLTH 
I : I I I I I I I I I !! I I I I I M I I I I I I I I M I I I I ! I I I i I I I I i i I I ) i I I I I I I I I I I 
gl 1 1 - 1 AKI QKRI DDALKEVNRQMSTYQTDSEI SRFNQHTAGKPLRI SSDFAHVTAEAVRLNRLTH 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 1 1 - 1 . pep GALDVTVGPLVNLWGFGPDKS VTREPSPEQI KQAAS YTGI DKI ILKQGKDYASLSKTHPK 
t | I ] j I I ! I I I II 1 I I I I I I I I I I t I I I I I I I I I 1 I I I I f I I 1 I I : I I I f I I II M I M I 
glll-1 GALDVTVGPLVNLWGFGPDKS VTREPSPEQI KQAAS YTGI DKI I LQQGKDYASLSKTHPK 

130 140 150 160 170 180 

190 200 210 220 230 240 

mlll-1 .pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 

I I [ I I I I I I I ! I I I I I I I I I I I I I I I I 1 M I I I I I I I I I I ! I M : I I 1 I I t I I i I I I t : ! 
glll-1 AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEI GGELHGKGKNAHGEPWRIGIEQPNI IQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 11-1. pep GGNTQI I VPLNNRSLATSGDYRI FHVDKNGKRLSH I INPNNKRPI SHNLASI SVV ADS AM 
M 1 I I I I I I I I I I I M I 1 I I ! I I I I 1 ! 1 I I I I M I I M I I I 1 I I I I 1 I I ! I I I I I : I I 1 I 
glll-1 GGNTQI I VPLNNRSLATSGDYRI FHVDKNGKRLSHIINPNNKRPI SHNLASI SVVSDSAM 

250 260 270 280 290 300 

310 320 330 340 350 

ml 11-1 .pep TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 

II I I I I I I I I I I I I II I I : I I I : I M I 1 I I I I I I I I I II I I I I I I I I I I I 
glll-1 TADGLSTGLFVLGETEALRLAEQEKLAVFLIVRDKDGYRTAMSSEFAKLLRX 

310 320 330 340 350 

glll-l/p44550 

sp | P4 4 550 1 YOJL_HAEIN HYPOTHETICAL LIPOPROTEIN HI0172 PRECURSOR >gi | 107 42 92 | pir | | C64 14 4 
hypothetical protein HI0172 - Haemophilus influenzae (strain Rd KW20) >gi | 1573128 (U32702) 
lipoprotein, putative [Haemophilus influenzae Rd] Length =34 6 
Score = 349 bits (885), Expect = 2e-95 

Identities - 177/328 (53%), Positives = 240/328 (72%), Gaps - 4/328 (1%) 

Query: 23 LNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSPAEIXKRIDDALKEXNRXMSTYQ 82 

L AC ++T + ++L G+TMGTTY VKYL + S + + 1+ LK+ N MSTY+ 

Sbjct : 17 LAACQKET-KVISLSGKTMGTTYHVKYLDDGSITATS-EKTHEEIEAILKDVNAKMSTYK 74 

Query: 83 PDSEISRFNQHT-AGKPLRISSDFAHVTAEAVRLNRLTHGALDVTVGPLVNLWGFGPDKS 141 

DSE+SRFNQ+T P+ IS+DFA V AEA+RLN++T GALDVTVGP+VNLWGFGP+K 
Sbjct: 75 K DS EL S RFN QN T Q VNT P I E I S ADF A K VL AE AI RLN K VT EGAL D VT VG P V VNLWG FG P E KR 134 

Query: 142 VTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPKAYLDLSSIAKGFGVDKVAGEL 201 

++P+PEQ+ + ++ GIDKI L K+ A+LSK P+ Y+DLSSIAKGFGVD+VA +L 
Sbjct: 135 PEKQPTPEQLAERQAWVGIDKITLDTNKEKATLSKALPQVYVDLSSIAKGFGVDQVAEKL 194 

Query: 202 EKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQGGNTQIIVPLNNRSLATSGDY 261 

E+ QNY+VEIGGE+ KGKN G+PW+I IE+P + ++ LNN +A+SGDY 

Sbjct: 195 EQLNAQNYMVEIGGEIRAKGKNIEGKPWQIAIEKPTTTGERAVEAVIGLNNMGMASSGDY 254 

Query: 262 RIFHVDKNGKRLSHI INPNNKRPI SHNLAS I SWADSAMTADGLSTGLFVLGETEALKLA 321 

RI+ ++NGKR +H I + P PI H+LASI+V+A ++MTADGLS TGLFVLGE +AL++A 
Sbjct: 255 RIY-FEENGKRFAHEIDPKTGYPIQHHLASITVLAPTSMTADGLSTGLFVLGEDKALEVA 313 

Query: 322 EREKLAVFLIVRDKGGYRTAMSSEFEKL 34 9 

E+ LAV+LI+R G+ T SS F+KL 
Sbjct: 314 EKNNLAVYLIIRTDNGFVTKSSSAFKKL 341 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 403>: 

alll-l . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAG CGC AT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 
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351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 



CCTGACACAC 
GGGGATTCGG 
ATCAAACAAG 
AGGCAAAGAT 
ATTTATCTTC 
CTGGAAAAAT 
GCACGGCAAA 
AACAGCCCAA 
AACAACCGTT 
TAAAAGCGGC 
CCATCAGCCA 
ACGGCGGACG 
CTTAAAGCTG 
ATAAAGGCGG 
CGCTAA 



GGCGCGCTGG 
CCCCGACAAA 
CAGCATCTTA 
TACGCTTCCT 
GATTGCCAAA 
ACGGCATTCA 
GGCAAAAACG 
CATCGTCCAA 
CGCTTGCCAC 
AAACGCCTCT 
CAACCTCGCC 
GCTTGTCCAC 
GCAGAGCGCG 
CTACCGCACC 



ACGTAACCGT 
TCCGTTACCC 
TACGGGCATA 
TGAGCAAAAC 
GGCTTCGGCG 
AAATTATCTG 
CGCGCGGCGA 
GGCGGCAATA 
TTCCGGCGAT 
CCCATATCAT 
TCCATCAGCG 
AGGATTATTC 
AAAAACTCGC 
GCCATGTCTT 



CGGCCCCTTG 
GTGAACCGTC 
GACAAAATCA 
CCACCCCAAG 
TTGATAAAGT 
GTCGAAATCG 
ACCTTGGCGC 
CGCAGATTAT 
TACCGTATTT 
TAATCCGAAC 
TGGTCGCAGA 
GTATTGGGCG 
TGTTTTCCTG 
CCGAATTTGA 



GTCAACCTTT 
GCCGGAACAA 
TTTTGAAACA 
GCCTATTTGG 
TGCGGGCGAA 
GCGGCGAGTT 
ATCGGCATCG 
CGTCCCGCTG 
TCCACGTCGA 
AACAAACGAC 
CAGTGCGATG 
AAACCGAAGC 
ATTGTCAGGG 
AAAACTGCTC 



This corresponds to the amino acid sequence <SEQ ID 404; ORF 1 1 1-1. a>: 

alll-l .pep 

1 MPSETRLPNF IRTLIFALSF IFTiNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVHLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKSG KRLSHIINPN NKRPISHNLA SISWADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 



alll-l/mlll-1 98.9% identity in 351 aa overlap 



10 20 30 40 50 60 

alll-l. pep MPSETRLPNFI RTLI FALS FI FLNACSEQTAQTVTLQGETMGTTYTVK YLSNNRDKLPS P 
I I I I I I I I I I I I : I M I I : M I I! I I M I I I I I I I i I i I I I I I i I t I I I I I I I I I I I I I I 
mlll-1 MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 

10 20 30 40 50 60 



70 80 90 100 110 120 

alll-l .pep AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVHLNRLTH 
I t I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I ! I I I I I I I I II I I I : I I I I I I 
ml 1 1- 1 AEIQKRIDDALKEVNRQMSTYQPDSEI SRFNQHTAGKPLRI S S D FAH VTAEAVRLNRLTH 

70 80 90 100 110 120 



130 140 150 160 170 180 

alll-l. pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 
I I I II I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I) I I I I I I I I I I I I I 
mlll-1 GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 

130 140 150 160 170 180 



190 200 210 220 230 240 

alll-l . pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
I I I I I I I I I I I I I I I I II II I I I I I U I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
mlll-1 AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 

190 200 210 220 230 240 



250 260 270 280 290 300 

all 1-1. pep GGNTQIIVPLNNRSLATSGDYRIFHVDKSGKRLSHIINPNNKRPISHNLASISVVADSAM 
I I I I I I II I I I I I I I II I i II I I I I I I I : I I I I I I I I I I I I I II I I I I I I I I ! II I I I I I 
ml 1 1 - 1 GGNTQI I VPLNNRSLATSGD YRI FH VDKNGKRLSHI I NPNNKRP ISHNLAS I S VVADSAM 

250 260 270 280 290 300 



310 320 330 340 350 

alll-l. pep TADGLSTGLFVLGETEALKLAEREKLAVFLI VRDKGG YRT AMSSEFEKLLRX 
I I I I I I I I I I I I I I I I I I I I I M I I M I I I II I I I 1 I I I I I I I I I I I I I I I I 
mlll-1 T ADGLSTGLFVLGETEALKLAEREKLAVFLI VRDKGG YRTAMSSEFEKLLRX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 405 >: 
gll4 . seq 

1 ATGGCTTCCA TCACTTCGCC GCTGCACGGG GCGCAGCAGG AATGCAG CAA 
51 GACTTTTTTA TGTCCGCCGG GCGGGACGAG TATGGGGCGG TCAATGTCGG 
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101 TAACGGTAGG TTTGTTTTGT GTTTCCATTA ACTTAACAAT ATCTGTCGAA 

151 TACGGTCAAA GCGGCTATTT TACCAGAGCC GCCGAATGTA AAACAGGGTG 

201 TCAGGGCATC AGCCCGAGCT GCCTGAACGA ACGGACGGTT TGCGAGGTAA 

251 CGATAAAATG GTCGAGCAGC GAAACATCAA CCAGCGACAT GGCCTGTGCC 

3 01 AGCCGCCTTG TGAACATGAT GTCTTCCTGC GAAGGTTCAG GCGAGCCGCC 

3 51 CGGATGGTTG TGCGCGATAA TCAGGCTGTC GGCATATTCG TCCAATGCCA 

4 01 GTTTGACGAT TTCGCGGATG TAA 

This corresponds to the amino acid sequence <SEQ ID 406- ORF 1 14 ne>- 
gll4.pep ' • 6 • 

1 MAS ITS PLHG AQQECSKTFL CP PGGTSMGR SMSVTVGLFC VSINLT1SVE 
51 YGQSGYFTRA AECKTGCQGI SPSCLNERTV CEVTIKWSSS ETSTSDMACA 
101 SRLVNMMSSC EGSGEPPGWL CAIIRLSAYS SNASLTISRM * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 407>- 

tnll4 . seq ^ 

1 ATGGCTTCCA TCACTTCGCC GCTGCACGGG GCGCACAGAG AATGCAGCAA 

51 GACTTTTTTA TGTCCACCGG GCGGGACGAG TATAGGGCGG TCAATGTCGG 

101 TAACGGTAGG TTTGTTTTGT GTTTCCATTA ACTTAACAAT ATCTGTTGAA 

151 TACGGTTGAA GCGGCTATTT TATCAGAGCC GCCGCATGTA AAACAGAGTG 

201 TCAGGGCATC AACCCGAGCT GTCTGAACGA ACAGACGCTT TGCGAkGTAA 

251 CGATAAAATG GTCGAGCAGC GACACATCGA CCAGCGACAT TGCCTGTGCC 

301 AGCCGCCTTG TGAACATGAT GTCTTCCTGC GAArGTTCsG GCGAGCCGcC 

351 CGgATGGTTG TGCGCAATAA TCAGGCTGTC GGCATATTCG TCCAATGCCA 

4 01 GTTTGACGAT TTCGCGGATG TAA 

This corresponds to the amino acid sequence <SEQ ID 408* ORF 1 14>- 

ml 14. pep ' 

1 MASITSPLHG AHRECSKTFL CPPGGTSIGR SMSVTVGLFCVSINLTISVE 

SI YGXSGYFIRA AACKTECQGI NPSCLNEQTL CXVTIKWSSS DTSTSDIACA 

101 SRLVNMMSSC EXSGEPPGWL CAIIRLSAYS SNASLTISRM * 

mll4/gll4 90.0% identity over a 140 aa overlap 

mll4.pep ^"splhgahrecsktflcppggtsigLm 

' ' I f 1 1 1 1 1 1 f : = i 1 1 1 1 1 1) f| 1 1 1 1 U M M 1 1 j M 1 1 j j 1 1 1 1 M I f 1 1 MM 1 1 
masitsp^qqecsktflcppggtsmgrsmsvwglfcvsinltisveygqsot 

10 20 30 40 50 60 



gll4 



tnll4 .pep 
gll4 



70 80 90 100 no 120 

t A ?^ E ???f N M?^f Q T L ?^T^ SSSDTSTSDIA ^^ VNMMSS CEXSGEPPGWL 

1 111 M'MMMhhl MM MhMMhMMMMMMM MMMM 
aecktgcqgispsclnerwcevtikwsssetstsd^^^ 

70 80 90 100 no 120 



130 140 

mll4.pep caiirlsayssnasltisrmx 

IIMMMMMMIMMM 
gii4 caiirlsayssnasltisrmx 

130 140 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 409>- 
all 4 . seq ^ • 



1 atgccggagg caagcatcgc ctccatcact tcgccgctgc acggggcgca 

51 acaggaatgc agcaagactt ttttatgtcc gccgggcggg acgagtatgg 

101 ggcggtcaat gtcggtaacg gtaggtttgt tttgtgtttc cattaactta 

151 ACGATATCTG TCGAATACGG TTGAAGCGGC TATTTTATCA GAGCCCrrrr 

201 ATGTAAAACA GGGTGTCAGG GCATCAGCCC GAGCTgSg ScgScGGA 

251 CGGTTTGCGC CGTTACGATA AAATGGTCGA GCAGCGACAC atcgaccaS 

?m ^JJ GCCT GTGCCAGCCG ccttgtgaac atgatgtctt cctgcSJgg 

351 ttcgggcgag ccgcccggat ggttgtgcgc gataatcagg ctgtcggcat 

401 attcgtccaa tgccagtttg acaatttcac ggatgtaa ctgtcggcat 
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This corresponds to the amino acid sequence <SEQ ID 410; ORF 1 14.a>: 

all4 .pep 

1 MPEASIASIT SPLHGAQQEC SKTFLCPPGG TSMGRSM SVT VGLFCVSINL 

51 TISV EYG*SG YFIRAAACKT GCQGISPSCL NERTVCAVTI KWSSSDTSTS 

101 DIACASRLVN MMSSCEGSGE PPGWLCAIIR LSAYSSNASL TISRM* 



mll4/all4 92.9% identity in 140 aa overlap 



10 20 30 40 50 

ml 14. pep MASITSPLHGAHRECSKTFLCPPGGTSIGRSMSVTVGLFCVSINLTISVEYGXSG 
: II I I I I II I I : : ! ! I I I I I I I I 1 M I : i I I I I I I M I I I I I I I I I I I I I M I I I 
all 4 MPEASIASITSPLHGAQQECSKTFLCPPGGTSMGRSMSVTVGLFCVSINLTISVEYGXSG 

10 20 30 40 50 60 

60 70 80 90 100 110 

ml 1 4 . pep YFIRAAACKTECQGINPSCLNEQTLCXVTIKWSSSDTSTSDIACASRLVNMMSSCEXSGE 
I I I I I I I I I I I I i I : I I I I I I : I : I I I I I I I I I I M M I I I I I I I I I I I I I II I Ml 
a 114 YFIRAAACKTGCQGISPSCLNERTVCAVTIKWSSSDTSTSDIACASRLVNMMSSCEGSGE 

70 80 90 100 110 120 



120 130 140 

m!14 .pep PPGWLCAIIRLSAYSSNASLTISRMX 
I M I I I I I I II II M I I I I I I I I I M 
al 1 4 PPGWLCAI IRL SAY S SN AS LT I SRMX 

130 140 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 41 1>: 

gll7 . seq 

1 atggtcgacg aactcgacCT GCTGCCCGAT GCCGTCGCCG CCACCCTGCT 

51 TGCCGACATC GGACGCTACG TCCCCGATTG GAACCTATTG GTTTCCGAGC 

101 GCTGCAACAG CACCGTCGCC GAGCTGGTCA AAGGTGtgga CGAAGTGCAG 

151 AAACTTACCC ACTTCGCCCG GGTGGACAGC CTCGCCACGC CGGAAGAACG 

201 CGCACAGCAA GCGGAAACCA TGCGGAAAAT GCTGCTGGCg atggttaccg 

251 Acatccgcgt cgtaTTAATC AAACTGGCGA TGCGTacgcg caccCTGcta 

3 01 ttTTtaaGCA ACGCCCCCGA CAGCCCTGAA AAACgcgccG TCgccaaAga 

3 51 aacccTCGAC ATCTTCGCCC CGCTCGCCAA CCGCTTGGGC GTGTGGCAGC 

4 01 TCAAATGGCA GCTCGAAGAT TTGGGCTTCC G C CAT CAAG A ACC CGAAAAA 
451 TACCGCGAAA TCGCCCTGCT TTTGGACGAA AAACGCACCG AACGCCTCGA 
501 ATACATCGAA AACTTCCTCG ATATCCTGCG TACGGAACTC AAAAAATACA 
551 AT AT CCACTT TGAAGTCGCC GGC CGTCCGA AACACAT CT A CTCCATTTAC 
601 AAAAAAATGG TGAAGAAAAA ACTCAGCTTC GACGgccTGT TCGACAT CCG 
651 CGCCGTGCGG ATTCTGGTCG ATACCGTCCC CGaGTGTTAC ACCACGCTGG 
701 gcaTCGTCCA CAGCCTCTGG CAGCCCATTC CCGGCGagtt CGAcgactAC 
751 ATCGCCAACC CCAAAGg c a A CGgttATAAA AGt TTGCACA CCGTCAT CGT 
801 cggcccGGAa gacaaaggtg tggaaGtgCA AATCCGCACC TTCGAtatGC 
851 accAATTCaa CgaatTcggT gtcgccgCCC ACTGGCGtta caaagaaggc 
901 ggcaaaggcg attccGCCtA cgaacaaAAA ATcgccTggt TGCgccaACT 
951 CTTGGACTGG CGCGAAAATA TGG CGGAAAG CGGCAAGGAA GACCTCGCCG 

1001 CCGCCTTCAA AACCGAGCTT TTCAACGACA CGATTTATGT TTTGACCCCG 

1051 CACGGCAAAG TCCTCTCTCT GCCAACGGGC GCAACCCCCA TCGACTTCGC 

1101 CTACGCCCTG CACAGCAGCA TcggCGACCG CTGCCGGGGC GCGAAAGTCG 

1151 AaggGCAGAT TGTGCCGCTG TCCACCCCGC TCGAAAACGG ACAGCGCGTC 

1201 GAAATcatta cCGCcaaAGA AGGGCATCCT TCCGTCAACT GGCTTTACGA 

1251 AGGctgGGtc aAATCCGGCA AGGCCATCGG caaAATCCGC GCCTAcatCC 

1301 GCCAGcaaAa cgCcgaCACC GTGCGCGAAG AAGGCCGTGT CCAACTCGAC 

1351 AAGCAGCTTG CCAAACTCAC GCCCAAACCC AACCTGCAAG AGCTTgccga 

14 01 aaATCTCGGC tacaaAAAGC cagaagacct ctacacCGCc gtcggacaag 

1451 gcgaaatttc caaccgcgcc atCcaaaaag cctgcggcac GCTgaacgaa 

1501 ccgccCCCCG TGCCCGTCAG CGCAACCACC ATCGTCAAAC AGTCCAAAAT 
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1551 CAAAAAAGGT GGCAAAACCG GCGTGCTCAT CGACGGCGAA GACGGCTTGA 
1601 TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGATATTGCC 
1651 GGCTTCGTTA CCCGCGAGCG CGGCATTTCC GTCCACCGCA AAACCTGCCC 
1701 CTCTTTCCGA CACCTTGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 
1751 GTTGGGCGGC GTTGCAGGAA GGGCAAGTGT TCGCCGTCGA TAT CGAAATC 
1801 CGCGCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTAT C CG ACGCGCTCGC 
1851 CCGCCACAAA CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 

1901 AAGC CAGCAT GAGGTTCACG CTCGAAGTCA AACAAGtCAA CGacCTCCCG 

1951 CGCGTCCTCG CCGGCCTCGG CGATGTCAAA GGCGTATTGA GCGTTACCCG 

2001 GCTTTAA 

This corresponds to the amino acid sequence <SEQ ID 412- ORP 117 na>- 

911 7. pep 5 B * 

1 MVDELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTHFARVDS LATPEERAQQ AETMR KMLLA MVTDIRWLI KLAM RTPTT.t. 

101 FLSNAPDSPE KRAVAKETLD IFAPLANRLG WQLKWQLED LGFRHQEPEK 

151 YREIALLLDE KRTERLEYIE NFLDILRTEL KKYNIHFEVA GRPKHIYSIY 

2 01 KKMVKKKLSF DGLFDIRAVR ILVDTVPECY TTLGIVHSLW QPIPGEFDDY 
251 IANPKGNGYK SLHTVIVGPE DKGVEVQIRT FDMHQFNEFG VAAHWRYKEG 

3 01 GKGDSAYEQK IAWLRQLLDW RENMAESGKE DLAAAFKTEL FNDTIYVLTP 

3 51 HGKVLSLPTG ATPIDFAYAL HSSIGDRCRG AKVEGQIVPL STPLENGQRV 
401 EIITAKEGHP SVNWLYEGWV KSGKAIGKIR AYIRQQNADT VREEGRVQLD 

4 51 KQLAKLTPKP NLQELAENLG YKKPEDLYTA VGQGEISNRA IQKACGTLNE 

5 01 PPPVPVSATT IVKQSKIKKG GKTGVLIDGE DGLMTTLAKC CKPAPPDDIA 
551 GFVTRERGIS VHRKTCPSFR HLAEHAPEKV LDASWAALQE GQVFAVDIEI 
601 RAQDRSGLLR DVSDALARHK LNVTAVQTQS RDLEASMRFT LEVKOVNDLP 
651 RVLAGLGDVK GVLSVTRL* 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 413>* 

mll7.seq (partial) a v . 

1 . . GTGAAACTCA AGAAATACAA TGTCCATTTC GAAGTCGCCG GCCGCCCGAA 
51 ACACATCTAC TCCATTTACA AAAAAATGGT GAAGAAAAAA CTCAGCTTCG 
101 ACGGCCTCTT TGACATCCGC GCCGTGCGAA TTCTGGTTGA TACCGTCCCC 
151 GAGTGTTACA CCACGCTGGG TATCGTCCAC AGCCTCTGGC AGCCCATTCC 
201 CGGCGAGTTC GACGACTACA TCGCCAATCC CAAAGGCAAC GGCTATAAAA 
251 GTTTG CACAC CGTCATCGTC GGCCCGGAAG ACAAAGGCGT GGAAGTACAA 

3 01 ATCCGCACCT TCGATATGCA CCAATTCAAC GAATTCGGTG TCGCCGCCCA 
351 CTGgCGTTAC AAAGAGGGCG GCAAGGGCGA TTCCGCCTAC GAACAGAAAA 

4 01 TCGCCTGGTT GCGCCAACTC TTGGACTGGC GCGAAAACAT GGCGGAAAGC 
4 51 GGCAAGGAAG ACCTCGCCGC CGCCTTCAAA ACCGAGCTTT TCAACGACAC 
501 GATTTATGTT TTGACCCCGC ACGGCAAAGT CCTCTCCCTG CCCACGGGCG 
551 CGACCCCCAT CGACTTCGCC TACGCCCTGC ACAGCAGCAT CGGCGACCGT 
601 TGCCGCGGTG CGAAAGTCGA AGGGCAGATT GTGCCGCTGT CCACCCCGCT 
651 CGAAAACGGA CAGCGCGTCG AAATCATTAC CGCCAAAGAA GGGCATCCTT 
701 CCGTCAACTG GCTTTACGAA GGCTGGGTCA AATCCAACAA GGCAATCGGC 
751 AAAATCCGCG CCTACATCCG CCAGCAAAAC GCCGACACCG TGCGCGAAGA 
8 01 AGGCCGCGTC CAACTCGACA AACAGCTTGC CAAACTCACG CCCAAACCCA 
8 51 ACCTGCAAGA GCTTGCCGAA AATCTCGGCT ACAAAAAGCG AGAAGACCTC 
901 TACACCGCCG TCGGACAAGG CGAAATTTCC AACCGCGCCA TCCAAAAAGC 
951 CTGCGGCACg CTGAACGAAC CGCCGCCCGT ACCCGTCAGC GAAACCACCA 

1001 TCGTCAAACA GTCCAAAATC AAAAAAGGCG GCAAAAACGG CGTGCTCATC 
1051 GACGGCGAAG ACGGTCTGAT GACCACGCTT GCCAAATGCT GCAAACCCGC 
1101 GCCGCCCGAC GATATTATCG GCTTCGTTAC CCGCGAGCGC GgCATTTCAG 
1151 TGCACCGCAA AwyyTkCyCG TCTTTCCAAC ACCTCGCCGA ACACGCGCCC 

12 01 GAwAAAGTGC TGGACGCAAG CTGGGCGGCA TTGCAGGAAG GACAAGTATT 
1251 CGCCGTCGAT ATCGAAATCC GCGCCCAAGA CCGCTCCGGG CTTTTGCGCG 

13 01 ACGTATCCGA CGCGCTCGCC CGCCACAAAC TCAACGTTAC CGCCGTGCAA 
1351 ACCCAGTCCC GCGACTTGGA AGCCAGCATG AGGTTCACGC TCGAAGTCAA 
1401 ACAAGTCAAC GACCTCCCGC GCGTCCTCGC CAGCCTCGGC GACGTCAAAG 
1451 GCGTATTGAG CGTTACCCGG CTTTAA ^^ICAAAG 

This corresponds to the amino acid sequence <SEO ID 414- ORF 1 1 7>- 

mll7.pep (partial) * ' ' 
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1 . . . . VKLKKYNVHF EVAGRPKHIY SIYKKMVKKK LS FDGLFDIR AVRILVDTVP 

51 ECYTTLGIVH SLWQPIPGEF DDYIANPKGN GYKSLHTVIV GPEDKGVEVQ 

101 IRTFDMHQFN EFGVAAHWRY KEGGKGDSAY EQKIAWLRQL LDWRENMAES 

151 GKEDLAAAFK TELFNDTIYV LTPHGKVLSL PTGATPIDFA YALHSSIGDR 

201 CRGAKVEGQI VPLSTPLENG QRVEIITAKE GHPSVNWLYE GWVKSNKAIG 

251 KIRAYIRQQN ADTVREEGRV QLDKQLAKLT PKPNLQELAE NLGYKKPEDL 

301 YTAVGQGEIS NRAIQKACGT LNEPPPVPVS ETTIVKQSKI KKGGKNGVLI 

3 51 DGEDGLMTTL AKCCKPAPPD DIIGFVTRER GISVHRKXXX SFQHLAEHAP 
401 XKVLDASWAA LQEGQVFAVD IEIRAQDRSG LLRDVSDALA RHKLNVTAVQ 

4 51 TQSRDLEASM RFTLEVKQVN DLPRVLASLG DVKGVLSVTR L* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 1 1 7 shows 97.6% identity over a 490 aa overlap with a predicted ORF (ORF 1 1 7.ng) 
from N. gonorrhoeae: 

mll7/gll7 

10 20 30 

ml 17 .pep VKLKKYNVHFEVAGRPKHI YS I YKKMVKKKL 

:MM|:|IIIIIII!MIIIIIIII1I|| 
gll7 EKYREIALLLDEKRTERLEYIENFLDILRTELKKYNIHFEVAGRPKHIYSI YKKMVKKKL 

150 160 170 180 190 200 

40 50 60 70 80 90 

ml!7 .pep SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 

1 1 1 1 1 1 M 1 1 i 1 1 1 1 1 1 1 1 ! 1 1 1 II 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I ! 1 1 1 1 1 1 1 1 

gl 1 7 SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPI PGEFDDYIANPKGNGYKSLHTVIVG 

210 220 230 240 250 260 

100 110 120 130 140 150 

tnll7 . pep P EDKGVE VQ I RT FDMHQ FNE FGVAAHWRYKEGGKGD S AYE QKI AWLRQ LLDWRENMAE S G 

IMMMi 111! II IMMMIIM MMMI MMIMMM Ml M M II! IMM M 

gl 1 7 PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKI AWLRQ LLDWRENMAESG 

270 280 290 300 310 320 

160 170 180 190 200 210" 

ml 17 .pep KEDLAAAFKTELFNDTIWLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 

llllll II MM MMMM MUM IMMMM II MM II Ml M MM IMMM I 

gll7 KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
330 340 350 360 370 380 

220 230 240 250 260 270 

m!17 . pep PLSTPLENGQRVE 1 1 TAKEGHPSVNWLYEGWVKSNKAIGKI RAYI RQQNADTVREEGRVQ 

Mill I III IMIIIIMMMIM IMIMI M:MIII! II III Ml Ml MMIIM 

g 1 1 7 PLSTPLENGQRVE 1 1 TAKEGHPSVNWL YEGWVKS GKAI GKI RAYI RQQNADTVREEGRVQ 

390 400 410 420 430 440 

280 290 300 310 320 330 

ml 1 7 . pep LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 

I I Mill! IIMIIIIIIMIIMI IIMMMI Ml MM II II I II I MMMIII I 

gl 1 7 IiDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSA 
450 460 470 480 490 500 

340 350 360 370 380 390 

mil 7 . pep TTI VKQS KI KKGGKNGVL I DGEDGLMTTLAKCC KPAP PDD I IGFVTRERGISVHRKXXXS 

II I M I 1 M I M 1 1 M 1 1 M 1 1 I II I M 1 1 MM 1 1 1 1 M I I M I I MM I M Mi ^ 

g 1 1 7 TTI VKQS KI KKGGKTGVL I DGEDGLMTTLAKCC KPAP PDD I AGFVTRERG I SVHRKT CPS 

510 520 530 540 550 560 

400 410 420 430 440 450 

mll7 .pep FQHLAEHAPXKVLDASWAALQEGQVFAVDI E I RAQDRSGLLRDVSDALARHKLNVTAVQT 
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MINIM IMIIIIIMIIIIIMlllllllllllMMMIIIIIIMMMIIII 

9117 FRHLAEHAPEKVLDASWAALQEGQVFAVDIEII^QDRSGLLRDVSDALARHKIjNVTAV^ 
570 580 590 600 610 620 

460 470 480 490 

ml 1 7 . pep QSRDLEASMRFTLEVKQVNDLPRVLASLGDVKGVLSVTRLX 

HMMIIMHIIMIIIMMIIhlMIIIMIIIIIl 

9117 OSRDLEASMRFTLEVKQVNDLPRVLAGLGDVKGVLSVTRLX 
630 640 650 660 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 415>- 
all7.seq 

1 ATGGTTCATG AACTCGACCT GCTCCCCGAT GCCGTCGCCG CCACCCTGCT 
51 TGCCGACATC GGACGCTACG TCCCCGACTG GAACCTATTG GTTTCCGAAC 
101 GCTGCAACAG TACCGTCGCC GAGCTGGTCA AAGGTGTGGA CGAAGTGCAG 
151 AAACTCACCC ACTTCGCCCG GGTGGACAGC CTCGCCACGC CGGAAGAACG 
201 CGCCCAGCAG G C AGAAAC T A TGCGGAAAAT GCTGCTGGCG ATGGTTACCG 
251 ACATCCGCGT CGTGTTAATC AAACTGGCGA TGCGTACGCG CACCCTGCAA 
301 TTTTTAAGCA ACGCCCCCGA CAGCCCCGAA AAACGCGCCG TCGCCAAAGA 

351 AACCCTCGAC ATCTTCGCCC CGCTCGCCAA CCGTTTGGGC GTGTGGCAGC 

4 01 TCAAATGGCA GCTCGAAGAT TTGGGCTTCC GCCATCAAGA ACCCGAAAAA 

4 51 TACCGCGAAA TCGCCCTGCT TTTGGACGAA AAACGCACCG AACGCCTCGA 

501 ATACATCGAA AACTTCCTTA ATATCCTGCG TACGGAACTC AAAAAATACA 

551 ATATCCACTT TGAAGTCGCC GGCCGTCCGA AACACATCTA CTCCATTTAC 

601 AAAAAAATGG TGAAGAAAAA ACTCAGCTTC GACGGGTTGT TCGACATCCG 

651 CGCCGTGCGG ATTCTGGTTG ATACCGTCCC CGAGTGTTAC ACCACACTGG 

7 01 GCATTGTCCA CAGCCTCTGG CAGCCCATTC CCGGCGAGTT CGACGACTAC 

7 51 ATCGCCAACC CGAAAGGCAA CGGCTATAAA AGTTTGCACA CCGTCATCGT 

801 CGGCCCGGAA GACAAAGGCG TGGAAGTGCA AATCCGCACC TTCGATATGC 

851 ACCAATTCAA CGAATTCGGT GTCGCCGCGC ACTGGCGTTA CAAAGAGGGC 

901 GGCAAAGGCG ATTCCGCCTA CGAACAAAAA ATCGCCTGGT TACGCCAACT 

951 TTTGGACTGG CGCGAAAACA TGGCGGAAAG CGGCAAGGAA GACCTCGCCG 

1001 CCGCCTTCAA AACCGAGCTT TTCAACGACA CGATTTATGT TTTGACCCCG 

1051 CACGGCAAAG TCCTCTCCCT GCCCACAGGC GCGACCCCCA TCGACTTCGC 

1101 CTACGCCCTG CACAGCAGCA TCGGCGACCG TTGCCGCGGT GCGAAAGTCG 

1151 AAGGGCAGAT TGTGCCGCTG TCCACCCCGC TCGAAAACGG ACAGCGTGTC 

12 01 GAAATCATTA CCGCCAAAGA AGGGCATCCT TCCGTCAACT GGCTTTACGA 

12 51 AGGCTGGGTC AAATCCAACA AGGCAATCGG CAAAATCCGC GCCTACATCC 

1301 GCCAGCAAAA CGCCGACACC GTGCGCGAAG AAGGCCGCGT CCAACTCGAC 

1351 AAACAGCTTG CCAAACTCAC GCCCAAACCC AACCTGCAAG AGCTTGCCGA 

14 01 AAATCTCGGC TACAAAAAGC CAGAAGACCT CTACACCGCC GTCGGACAAG 

14 51 GCGAAATTTC CAACCGCGCC ATCCAAAAAG CCTGCGGCAC GCTGAACGAA 

1501 CCGCCGCCCG TACCCGTCAG CGAAACCACC ATCGTCAAAC AGTCCAAAAT 

1551 CAAAAAAGGC GGCAAAAACG GCGTGCTCAT CGACGGCGAA GACGGTCTGA 

1601 TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGACATTGTC 

1651 GGCTTCGTTA CCCGCGATCG CGGCATTTCG GTACACCGCA AAACCTGCCC 

1701 CTCTTTCCGA CACCTCGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 

1751 GTTGGGCGGC GTTGCAGGAA GGACAAGTGT TCGCCGTCGA TATCGAAATC 

1801 CGCGCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTATCCG ACGCGCTCGC 

1851 CCGCCACAAA CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 

1901 AAGCCAGCAT GAGGTTCACG CTCGAAGTCA AACAAGTTAC CGACCTCCCA 

1951 CGCGTCCTCG CCAGCCTCGG CGACGTCAAA GGCGTATTGA GCGTTACCCG 

2001 GCTTTAA 

This corresponds to the amino acid sequence <SEQ ID 416; ORP 1 17.a>: 

all7.pep 

1 MVHELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTHFARVDS LATPEERAQQ AETMR KMLLA MVTDIRWLI KLAMRTRTT.O 

101 FLSNAPDSPE KRAVAKETLD IFAPLANRLG VWQLKWQLED LGFRHQE PEK 

151 YREIALLLDE KRTERLEYIE NFLNILRTEL KKYNIHFEVA GRPKHIYSIY 

201 KKMVKKKLSF DGLFDIRAVR ILVDTVPECY TTLGIVHSLW QPIPGEFDDY 

251 IANPKGNGYK SLHTVIVGPE DKGVEVQIRT FDMHQFNEFG VAAHWRYKEG 

301 GKGDSAYEQK IAWLRQLLDW RENMAESGKE DLAAAFKTEL FNDTIYVLTP 

351 HGKVLSLPTG ATPIDFAYAL HSS IGDRCRG AKVEGQIVPL STPLENGQRV 
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401 EIITAKEGHP SVNWLYEGWV KSNKAIGKIR AYIRQQNADT VREEGRVQLD 

4 51 KQLAKLTPKP NLQELAENLG YKKPEDLYTA VGQGEISNRA IQKACGTLNE 

501 PPPVPVSETT IVKQSKIKKG GKNGVLIDGE DGLMTTLAKC CKPAPPDDIV 

551 GFVTRDRGIS VHRKTCPSFR HLAEHAPEKV LDASWAALQE GQVFAVDIEI 

601 RAQDRSGLLR DVSDALARHK LNVTAVQTQS RDLEASMRFT LEVKQVTDLP 

651 RVLASLGDVK GVLSVTRL* 



mll7/a!17 98.0% identity in 490 aa overlap 



10 20 30 

ml 17 . pep VKLKKYN VH FE VAGRPKH I YS I YKKMVKKKL 

: I I I II : It I I I I M I I I I I I II I I I I I M 
all7 EKYREIALLLDEKRTERLEYIENFLNILRTELKKYNIHFEVAGRPKHIYS I YKKMVKKKL 

150 160 170 180 190 200 

40 50 60 70 80 90 

ml!7 .pep S FDGLFDI RAVRI LVDT VPECYTTLGI VHSLWQP I PGE FDDYI AN PKGNGYKS LHTVI VG 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I M I I I I I I I I 
all7 SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPI PGE FDDYIAN PKGNGYKS LHTVI VG 

210 220 230 240 250 260 



100 110 120 130 140 150 

mll7 . pep PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
I I t I I I I I I II I I I I I I M I I I I I II I I I I I I I I I I I I I I I I II I I t I I I I II | M I I I I 
all7 PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
270 280 290 300 310 320 



160 170 180 190 200 210 

mll7 . pep KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
I I I I I I t I M I I I M I I I I II I I I I i I II M \ I I M I I I I t I I ! I I I I I I I I I I I I M I T 
all7 KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
330 340 350 360 370 380 



220 230 240 250 260 270 

ml 17 . pep PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 
I I I I I I I I I I ! I I I I I I I M I f I I I I I I M I I I I I M I I I I I I I I I II I I I I I 1 I I I I i I 
a 1 1 7 PLSTPLENGQRVE I ITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 

390 400 410 420 430 440 

280 290 300 310 320 330 

mll7 . pep LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQICACGTLNEPPPVPVSE 
I I I I I I I I I I I I I I I I I ! I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I! I I I I I I I I I 
all7 LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 
450 460 470 480 490 500 

340 350 360 370 380 390 

mll7 . pep TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKXXXS 
I I II I I I I I I M I I I I I I I I I I I I I I I I I I I I 1 I II I I I I I : I I I I I : I I I I E M I : I 
all 7 TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIVGFVTRDRGISVHRKTCPS 
510 520 530 540 550 560 



400 410 420 430 440 450 

mll7 . pep FQHLAEHAPXKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
1:1111111 I I I I I I M I I I I I I It II I I I I I I I I I I M M I I I I I i I | I | | I | | | | M 
all7 FRHLAEHAPEKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
570 580 590 600 610 620 



460 470 480 490 

ml 17 . pep QSRDLEASMRFTLEVKQVNDLPRVLASLGDVKGVLSVTRLX 
I I I I I I II I I I I I II I I I : I II I M I I I I I I M I I M I I I I 
all7 QSRDLEASMRFTLEVKQVTDLPRVLASLGDVKGVLSVTRLX 
630 640 650 660 
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The following partial DNA sequence was identified in K gonorrhoeae <SEO ID 417>- 

gll7-l.seq ^ 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CCCTGCAAGA 

51 ATTGCGCGAA TGGTTCGACA GCTACTGCGC CGCTCTGCCG GACAACGATA 

101 AAAACCTCAT CGGTACCGCA TGGTCGCTGG CGCAGGAACA TTATCCTGCC 

151 GATGCCGCCA CGCCGTATGG CGAGCCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTCGACGAAC TCGACCTGCT GCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGATTGGAA CCTATTGGTT 

301 TCCGAGCGCT GCAACAGCAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTTACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

401 AAGAACGCGC ACAGCAAGCG GAAACCATGC GGAAAATGCT GCTGGCGATG 

4 51 GTTACCGACA TCCGCGTCGT ATTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCTGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG CTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAGAACC 

651 CGAAAAATAC CGCGAAATCG CCCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTCGATA TCCTGCGTAC GGAAC TC AAA 

7 51 AAATACAATA TCCACTTTGA AGTCGCCGGC CGTCCGAAAC ACATCTACTC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGCCTGTTCG 

851 ACATCCGCGC CGTGCGGATT CTGGTCGATA CCGTCCCCGA GTGTTACACC 

901 ACGCTGGGCA TCGTCCACAG CCTCTGGCAG CCCATTCCCG GCGagttCGA 



951 cgactACATC GCCAACCCCA AAGgcaACGg ttATAAAAGt TTGCACACCG 
1001 TCATCGTcgg cccGGAagaa aaaggtgtgg aagtgcAAAT CCGCACCTTC 
1051 GATATGCacc AATTCaaCga ATTCGGTGTC GCCGCCCACT GGCGTTACAA 
1101 AGAAGGCGGC AAAGGCGATT CCGCCTACGA ACAAAAAATC GCCTGGTTGC 
1151 GCCAACTCTT GGACTGGCGC GAAAATATGG CGGAAAGCGG CAAGGAAGAC 
1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 
1251 GACCCCGCAC GGCAAAGTCC TCTCTCTGCC AACGGGCGCA ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGCTG CCGGGGCGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

14 01 GCGCGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

14 51 TTTACGAAGG CTGGGTCAAA TCCGGCAAGG CCATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGTGTCCA 

1551 ACTCGACAAG CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTGC CCGTCAGCGC AACCACCATC GTCAAACAGT 

1751 CCAAAATCAA AAAAGGTGGC AAAACCGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGCTTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

1851 TATTGCCGGC TTCGTTACCC GCGAGCGCGG CATTTCCGTC CACCGCAAAA 

1901 CCTGCCCCTC TTTCCGACAC CTTGCCGAAC ACGCGCCCGA AAAAGTACTG 

1951 GACGCAAGTT GGGCGGCGTT GCAGGAAGGG CAAGTGTTCG CCGTCGATAT 

2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTCAACGA 

2151 CCTCCCGCGC GTCCTCGCCG GCCTCGGCGA TGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA x^nwtow* 

This corresponds to the amino acid sequence <SEQ ID 418- ORF 117-1 ne>- 

gll7-l.pep ' • & • 

1 MTAISPIQDT QSATLQELRE WFDSYCAALP DNDKNLIGTA WSLAQEHYPA 

51 DAATPYGEPL PDHFLGAAQM VDELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTH FAR VDSL ATPEERAQQA ETMRKMLLAM 

151 VTDIRVVLIK LAMRTRTLOF LSNAPDSPEK RAVAKETLDI FAPLA NRLGV 

201 WQLKWQLEDL GFRHQEPEKY REIALLLDEK RTERLEYIEN FLDILRTELK 

251 KYNIHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PIPGEFDDYI ANPKGNGYKS LHTVIVGPEE KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

401 LAAAFKTELF NDTIYVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

451 KVEGQIVPLS TPLENGQRVE IITAKEGHPS VNWLYEGWVK SGKAIGKIRA 

501 YI RQQNADT V REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGEISNRAI QKACGTLNEP PPVPVSATTI VKQSKIKKGG KTGVLIDGED 

601 GLMTTLAKCC KPAPPDDIAG FVTRERGISV HRKTCPSFRH LAEHAPEKVL 

651 DAS WAAL QEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVNDLPR VLAGLGDVKG VLSVTRL* 

S?-!^^ 8 Partial SequenCe was identi fied in N. meningitidis <SEQ ID 41 9> : 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CTCTGCAAGA 



BNSDOCID: <WO 9957280A2_L> 



WO 99/57280 PCT/US99/09346 

335 



51 ATTGCGCGAA TGGTTCGACA GCTACTGCGC CGCTCTGCCG GACAACGATA 

101 AAAACCTCAT CGGTACCGCA TGGTTGCTGG CGCAGGAACA TTACCCCGCC 

151 GATGCCGCCA CGCCGTATGG CGAGCCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTTCATGAAC TCGACCTGCT CCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGACTGGAA CCTATTGGTT 

301 TCCGAACGCT GCAACAGTAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTCACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

401 AAGAACGCGC CCAGCAGGCA GAAACTATGC GGAAAATGCT GCTGGCGATG 

4 51 GTTACCGACA TCCGCGTCGT GTTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCCGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG TTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAAAGCC 

651 CGAAAAATAC CGCGAAATCG CGCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTCAACA TCCTGCGCGG TGAACTCAAG 

751 AAATACAATG TCCATTTCGA AGTCGCCGGC CGCCCGAAAC ACATCTACTC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGCCTCTTTG 

8 51 ACATCCGCGC CGTGCGAATT CTGGTTGATA CCGTCCCCGA GTGTTACACC 

901 ACGCTGGGTA TCGTCCACAG CCTCTGGCAG CCCATTCCCG GCGAGTTCGA 

951 CGACTACATC GCCAATCCCA AAGGCAACGG CTATAAAAGT TTGCACACCG 

1001 TCATCGTCGG CCCGGAAGAC AAAGGCGTGG AAGTACAAAT CCGCACCTTC 

1051 GATATGCACC AATTCAACGA ATTCGGTGTC GCCGCCCACT GGCGTTACAA 

1101 AGAGGGCGGC AAGGGCGATT CCGCCTACGA AC AGAAAAT C GCCTGGTTGC 

1151 GCCAACTCTT GGACTGGCGC GAAAACATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

1251 GACCCCGCAC GGCAAAGTCC TCTCCCTGCC CACGGGCGCG ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGTTG CCGCGGTGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

14 01 GCGCGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

14 51 TTTACGAAGG CTGGGTCAAA TCCAACAAGG CAATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGCGTCCA 

1551 ACTCGACAAA CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTAC CCGTCAGCGA AACCACCATC GTCAAACAGT 

1751 CCAAAATCAA AAAAGGCGGC AAAAACGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGTCTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

1851 TATTATCGGC TTCGTTACCC GCGAGCGCGG CATTTCAGTG CACCGCAAAA 

1901 CCTGCCCGTC TTTCCAACAC CTCGCCGAAC ACGCGCCCGA AAAAGTGCTG 

1951 GACGCAAGCT GGGCGGCATT GCAGGAAGGA CAAGTATTCG CCGTCGATAT 

2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTCAACGA 

2151 CCTCCCGCGC GTCCTCGCCA GCCTCGGCGA CGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 

This corresponds to the amino acid sequence <SEQ ID 420; ORF 1 17-1>: 

mll7-l .pep 

1 MTAISPIQDT QSATLQELRE WFDSYCAALP DNDKNLIGTA WLLAQEHYPA 

51 DAATPYGEPL PDHFLGAAQM VHELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRK MLLAM 

151 VTDIRVVLIK LAM RTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQKPEKY REIALLLDEK RTERLEYIEN FLNILRGELK 

251 KYNVHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PIPGEFDDYI ANPKGNGYKS LHTVIVGPED KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

401 LAAAFKTELF NDTIYVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

451 KVEGQIVPLS TPLENGQRVE IITAKEGHPS VNWLYEGWVK SNKAIGKIRA 

501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGEISNRAI QKACGTLNEP PPVPVSETTI VKQSKIKKGG KNGVLIDGED 

601 GLMTTLAKCC KPAPPDDIIG FVTRERGISV HRKTCPSFQH LAEHAPEKVL 

651 DASWAALQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVNDLPR VLASLGDVKG VLSVTRL* 

mll7-l/gll7-l 98.2% identity in 737 aa overlap 

10 20 30 40 50 60 

mll7-l .pep MT A I S P I QDTQS AT LQEL REW FDS YC AAL PDNDKNLIG T AWLL AQE H Y P AD AAT P YG E P L 
I M I I I I I I I I I I I II I I ! I I I I I I I I I I 1 i I I I I I I I I I I I I I I I I I I I I I II I I I I I 
gll7-l MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWSLAQEHYPADAATPYGEPL 

10 20 30 40 50 60 

70 80 90 100 110 120 
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mll7-l . pep PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
MMIIIMII I I I i ! I I I I ! I I I I ! I [ I ] I | | | I | M I 1 I I I |[ I | | M | | [ , , | | | | 
PDHFLGAAQMVDELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
70 80 90 100 HO 120 

130 140 150 160 170 180 

ml!7-l . pep LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 
, 111 ,Mllll| ll'lllfi|llillMI||M|||H|||||,, M | tI1 ,, M||1)|| 

gll7-l LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 
130 140 150 160 170 180 

iin ^ I 90 200 210 220 230 240 

mll7-l . pep RAVAKETLDIFAPLANRLGVWQLKWQLEDLGFRHQKPEKYREIALLLDEKRTERLEYIEN 
' II IIIMIIMI IMHI IIMill llllll Nl;i[ || MMIill I IMIIIIMI I 
g!17-l RAVAKETLDIFAPLANRLGVWQLKWQLEDLGFRHQEPEKYREIALLLDEKRTERLEYIEN 
190 200 210 220 230 240 

250 260 270 280 290 300 

mll7-l. P ep FLNILRGELKKYNVHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 
^ n I I : i I I ! I ! I 1 f : J I I I I I I I I I I I I I ( I f I I I I I | I I I I ! I I I I | I i I I 1 1 1 1 ! 1 1 , , 

gll7-l FLDILRTELKKYNIHFEVAGRPKHI YSIYKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 

250 260 270 280 290 300 

310 320 330 340 350 360 

mll7-l.pep TLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGV 
, 1 1 f 1 1 1 I I I I I 1 I I I I I I I I I I I I ( I M t I I t I I I f If I : I I I I I I l l l 1 1 M j , j j , . , 

gll7-l TLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVGPEEKGVEVQIRTFDMHQFNEFGV 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 1 7 - 1 . pep AAHWRYKEGGKGDSAYEQKI AWLRQLLDWRENMAESGKEDLAAAFKTELFNDTI YVLTPH 
1 1 1 I ' I f ' I I I I ! I I I I t I I t ! I I f f I I I M i I I I I I I I I I I ! M I I I I I j I j I ! I j 1 f t 
AAHWRYKEGGKGDSAYEQKI AWLRQLLDWRENMAESGKEDLAAAFKTELFNDT I YVLTPH 
370 380 390 400 410 420 

430 44 0 450 460 470 480 

mll7-l.pep GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 
„ n , 11 1 1 1 I I I 1 ! I I I I I I I i I I I t 1 M I I I H I I | 1 M 1 1 1 ( 1 1 1 1 1 1 1 , , , 1 j j , 1 , , , , , 

gll7-l GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 
430 44 ° 450 460 470 480 

, 490 500 510 520 530 540 

II1117-1 . pep VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
1 1 1 I ' » I I I I I : I I I H I I I I I I I M I I I I I I t I I I I I I I I I I I I I I I I 1 I I I j I j j M i 
VNWLYEGWVKSGKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
490 500 510 520 530 540 

550 560 570 580 590 600 

mll7-l.pep KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLIDGED 
, " J I) I I II I I I I I I I I I I I M I | I I J I I I I I I I l j I j I I I I J I I I j I j 1 : j 1 I 1 1 1 1 j 

gll7-l KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSATTIVKQSKIKKGGKTGVLIDGED 
550 560 570 580 590 600 

610 6 20 630 640 650 660 

mll7-l.pep GLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
1n , HIM lllllllllllll I I I I I I I I I I I I I I I I I | | : | | |) | | | | | | | | | j ,,, r ,, , 

gll7-l GLMTTLAKCCKPAPPDDIAGFVTRERGISVHRKTCPSFR^ 

610 620 630 640 650 660 

670 680 690 700 710 720 

mll7-l . pep QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
''''NIMiilllllllllllllllllllllllllitlllMMMMilllMMin 
QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
670 680 690 700 710 720 

730 

ml 17-1. pep VLASLGD VKG VLS VTRLX 
I I I : I I I ! I I II I I I I I I 
gll7~l VLAGLGDVKGVLS VTRLX 

730 

mll7-l/RelA 



gll7-l 



gll7-l 



gll7-i 
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sp|P55133 |RELA_VIBSS GTP PYROPHOSPHOKINASE { ATP : GTP 3 ' -PYROPHOSPHOTRANSFERASE) (PPGPP 
SYNTHETASE I) >gi 1537617 (U13769) ppGpp synthetase I [Vibrio sp.J Length - 744 
Score <= 536 bits (1366), Expect = e-151 

Identities = 288/685 (42%), Positives = 432/685 (63%), Gaps = 31/685 (4%) 

Query: 74 LDLLPDAVAATLLADI GRYVPDWNLLVSERCNSTVAELVKGVDEVQKLTHFARVDSL 130 

L+D + ALL+ GYD +E+T+ LV+GV+++ ++ ++ S 
Sbjct: 68 LSMDADTLIAALLYPLVEGGCYSTD ALKEEYSGTI LHLVQG VEQMC AI S QLKST 121 

Query: 131 ATPEERAQQAETMRKMLLAMVTDIRWLIKLAMRTRTLQFLSNAPDSPEKRAVAKETLDI 190 

A +A Q + +R+MLL+MV D R V+IKLA R L+ + + PD +RA A+E +1 

Sbjct: 122 AEETAQAAQVDNIRRMLLSMVDDFRCVVIKLAERICNLREVKDQPDEV-RRAAAQECANI 180 

Query: 191 FA P LAN RLG V WQL KWQLE DL G FRH QK P E K Y RE I AL L L DEKRT E RLE Y I EN FLNILRGELK 250 

+APLANRLG+ QLKW++ED FR+Q P+ Y+ + IA L E+R +R +YI +F++ L +K 
Sbjct: 181 YAPLANRLGIGQLKWEIEDYAFRYQHPDTYKQIAKQLSERRIDREDYITHFVDDLSDAMK 240 

Query: 2 51 KYNVHFEVAGRPKHIYSI YKKMVKKKLSFDGLFDIRAVRI LVDTVPECYTTLGIVHSLWQ 310 

N+ EV GRPKHIYSI++KM KK L FD LFD+RAVRI+ + + +CY LG+VH+ + + 
Sbjct: 241 ASNIRAEVQGRPKHIYSIWRKMQKKSLEFDELFDVRAVRIVAEELQDCYAALGVVHTKYR 300 

Query: 311 PIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEG- 369 

+P EFDDY+ANPK NGY+S+HTV++GPE K +E+QIRT MH+ +E GVAAHW+YKEG 
Sbjct: 301 HLPKEFDDYVANPKPNGYQSIHTVVLGPEGKTIEIQIRTKQMHEESELGVAAHWKYKEGT 360 

Query: 370 — GKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPHGKVLSLP 427 

G SAY++KI WLR+LL W+E M++SG ++ ++++F+D +Y TP G V+ LP 
Sbjct: 361 ASGGAQSAYDEKINWLRKLLAWQEEMSDSG — EML DE L RS QVFD DR V Y A FT P KG D W DL P 418 

Query: 428 TGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPSVNWLYE- 486 

+ ATP+DFAY +HS +G RC GAKVEG+IVP + L+ G +VEIIT KE +PS +WL 
Sbjct: 419 SNATPLDFAYHIHSEVGHRCIGAKVEGRIVPFTYHLQMGDQVEIITQKEPNPSRDWLNPN 47 8 

Query: 487 -GWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKL — TPKPNLQELAENLGYKKP 543 

G+V S++A K+ A+ R+Q+ D G+ L+ +L K+ T K A+ K P 

Sbjct: 479 LGFVTSSRARAKVHAWFRKQDRDKNIIAGKEILEAELVKIHATLKDAQYYAAKRFNVKSP 538 

Query: 544 EDLYTAVGQGEIS-NRAIQKACGTLNEPPPVPVSETTIVKQSKI KKGGKNGV 594 

E+LY +G G++ N+ I +N+P + + K S+ KK ++ V 

Sbjct: 539 EELYAGIGSGDLRINQVINHINALVNKPTAEEEDQQLLEKLSEASNKQATSHKKPQRDAV 598 

Query: 595 LIDGEDGLMTTLAKCCKPAPPDDI IGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASW 654 

+++G D LMT LA+CC+P P DDI GFVT+ RGISVHR C + L HAPE+++D W 
Sbjct: 599 WEGVDNLMTHLARCCQPIPGDDIQGFVTQGRGISVHRMDCEQLEELRHHAPERIIDTVW 658 

Query: 655 AALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQ— SRDLEASMRFTLEV 712 

G + + + + A +R+GLL+++++ L K+ V ++++ + + M F LE+ 
Sbjct: 659 GGGFVGN-YTITVRVTASERNGLLKELTNTLMNEKVKVAGMKSRVDYKKQMSIMDFELEL 717 

Query: 713 KQVNDLPRVLASLGDVKGVLSVTRL 737 

+ L RVL + VK V RL 
Sbjct: 718 TDLEVLGRVLKRIEQVKDVAEAKRL 742 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 42 1>: 

all7-l . seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CTCTGCAAGA 

51 ATTGCGCGAA TGGTTCGACA GCTACTGCAC CGCGCTGCCG AACAACGATA 

101 AAAAACTTGT CTTAGCCGCC CGTTCGCTGG CGGAAGCACA TTACCCCGCC 

151 GATGCCGCCA CGCCGTATGG CGAACCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTTCATGAAC TCGACCTGCT CCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGACTGGAA CCTATTGGTT 

301 TCCGAACGCT GCAACAGTAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTCACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

4 01 AAGAACGCGC CCAGCAGGCA GAAACTATGC GGAAAATGCT GCTGGCGATG 

4 51 GTTACCGACA TCCGCGTCGT GTTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCCGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG TTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAGAACC 

651 C GAAAAAT AC CGCGAAATCG CCCTGCTTTT GGACGAAAAA CGCACCGAAC 

7 01 GCCTCGAATA CATCGAAAAC TTCCTTAATA TCCTGCGTAC GGAACTCAAA 
751 AAATACAATA TCCACTTTGA AGTCGCCGGC CGTCCGAAAC ACATCTACTC 

8 01 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGGTTGTTCG 
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851 ACATCCGCGC CGTGCGGATT CTGGTTGATA CCGTCCCGGA GTGTTACACC 

901 ACACTGGGCA TTGTCCACAG CCTCTGGCAG CCCATTCCCG GCGAGTTCGA 

951 CGACTACATC GCCAACCCGA AAGGCAACGG CTATAAAAGT TTGCACACCG 

1001 TCATCGTCGG CCCGGAAGAC AAAGGCGTGG AAGTGCAAAT CCGCACCTTC 

1051 GATATGCACC AATTCAACGA ATTCGGTGTC GCCGCGCACT GGCGTTACAA 

1101 AGAGGGCGGC AAAGGCGATT CCGCCTACGA ACAAAAAATC GCCTGGTTAC 

1151 GCCAACTTTT GGACTGGCGC GAAAACATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

1251 GACCCCGCAC GGCAAAGTCC TCTCCCTGCC CACAGGCGCG ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGTTG CCGCGGTGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

1401 GCGTGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

14 51 TTTACGAAGG CTGGGTCAAA TCCAACAAGG CAATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGCGTCCA 

1551 ACTCGACAAA CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTAC CCGTCAGCGA AACCACCATC GTCAAACAGT 

1751 CCAAAATCAA AAAAGGCGGC AAAAACGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGTCTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

1851 CATTGTCGGC TTCGTTACCC GCGATCGCGG CATTTCGGTA CACCGCAAAA 

1901 CCTGCCCCTC TTTCCGACAC CTCGCCGAAC ACGCGCCCGA AAAAGTACTG 

1951 GACGCAAGTT GGGCGGCGTT GCAGGAAGGA CAAGTGTTCG CCGTCGATAT 

2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTTACCGA 

2151 CCTCCCACGC GTCCTCGCCA GCCTCGGCGA CGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 



This corresponds to the amino acid sequence <SEQ ID 422; ORF 117-1 a>- 

all7-l.pep 5 



1 


MTAISPIQDT 


51 


DAATPYGEPL 


101 


SERCNSTVAE 


151 


VTDIRVVLIK 


201 


WQLKWQLEDL 


251 


KYNIHFEVAG 


301 


TLGIVHSLWQ 


351 


DMHQFNEFGV 


401 


LAAAFKTELF 


451 


KVEGQIVPLS 


501 


YIRQQNADTV 


551 


GQGEISNRAI 


601 


GLMTTLAKCC 


651 


DASWAALQEG 


701 


DLEASMRFTL 


all7-l/mll7-l 97.7% 



97.7% identity in 737 aa overlap 

10 20 30 40 50 60 

mll7-l.pep MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWLLAQEHYPADAATPYGEPL 
MMHItiMIIIIIIIIMinici, f:M]:I: :| I I : | | | [| | | | | I | | i | 
all7-l MTAISPIQDTQSATLQELREWFDSYCTALPNNDKK^ 

50 60 



10 20 30 40 



mll7-l .pep 
all7-l 



70 80 90 100 no 120 

PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVOK 
1 1 1 1 1 1 1 ' 1 1 } ' > I ! I I j ! I 1 M I I I I I f | | | | I | | ] | 1 | ! I i I | I | | | M I I I I II I It 
PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
70 80 90 100 HO 120 

130 140 150 160 170 IRQ 

mll7-l.pep LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 
1 1 ! IIIMI,l| l'lllllillllllllll|||||IIIM|||M|||||||| Mf , | IM | 

all7-l LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTL^ 

130 140 150 160 170 180 



ml 17-1 .pep 



190 200 210 220 230 240 

RAVAKETLDIFAPLANRLGVWQLKWQLEDLGFRHQKPEKYREIALLLDEKRTERLEYIEN 



all7-l RAVAKETLDIFAPLANRLGVWQLKWQLEDLGFRHQEPEKYREIALL^ 



190 200 210 220 



230 240 
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250 260 270 280 290 300 

ml 17-1 .pep FLNILRGELKKYNVHFEVAGRPKHI YSI YKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 
I I I I I I I I I II I : I I I I I I I I I I I I I I I I I I 1 I I I I I I I I i I I I I I I II I I I I I I I I I I 
all7-l FLNILRTELKKYNIHFEVAGRPKHIYSI YKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 17-1 .pep TLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGV 
I I I I I I I I I I I I I I I I II I I I I I I I I I 1 I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 1 
a 1 1 7 - 1 TLGI VHSLWQPI PGEFDDYI ANPKGNGYKSLHTVI VGPEDKGVEVQI RTFDMHQFNEFGV 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 1 7 - 1 . pep AAHWRYKEGGKGDSAYEQKI AWLRQLLDWRENMAESGKEDLAAAFKTELFNDT I YVLTPH 
I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
all7-l AAHWRYKEGGKGDSAYEQKI AWLRQLLDWRENMAESGKEDLAAAFKTELFNDT I YVLTPH 

370 380 390 400 410 420 

430 440 450 460 470 480 

mll7-l.pep GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 
I I I I I II I I I I II I I I 1 I I I I I I I I I I I I I I I I I I I II I I I I I I I I J I I I I I I I I I 1 I I I 
a 1 17 - 1 GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQI VPLSTPLENGQRVEI ITAKEGHPS 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 1 7 - 1 . pep VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
I f I I I I 1 I M I I I I I I I I I 1 I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
all7-l VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 17-1 . pep KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLIDGED 
I I I I II 1 I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
all7-l KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLIDGED 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml 17-1 .pep GLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
I I II I I I I I I I I I I I I I I : I I I I I : I I I I I I I I I 1 I I I : I I I I I I I I I I I I I I I I I I I I I 
all7-l GLMTTLAKCCKPAPPDDIVGFVTRDRGISVHRKTCPSFRHLAEHAPEKVLDASWAALQEG 

610 620 630 640 650 660 

670 680 690 700 710 720 

ml 17-1 .pep QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I I! i I I I I I I I I I I I I I : I I I I 
a 1 1 7 - 1 QVFAVDI E I RAQDRS GLLRDVSDALARHKLN VT AVQTQS RDLEASMRFTLEVKQVTDLPR 

670 680 690 700 710 720 

730 

mll7-l .pep VL AS LG D VKGVL S VT RLX 
I I I I I I I I I I I I I I I I I I 
a 1 1 7 - 1 VLAS LGDVKG VLS VTRLX 

730 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 423>: 

gll8 . seq 

1 ATGTGCGAGT TCAAGGATTT TAGAAGAAAC ATCCCTTGTT TTGAAGAGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAGCTG GAGAATGATT TAATcgaGGT TAGGAGAAAA 

151 TATCCTTATC CGATGGATAT ACCAAGGGAT ATTGTGATTG GAATCGGTAC 

201 CATTATTGAT TTTTTAATGG TTCCAAATTG GGAGCTTTTT GAAATTAAAG 

251 CTTCCCCTTG GTTGCCTGAT AGCGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAaCA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 424; ORF 1 18.ng>: 

gll8 . pep 

1 MCEFKDFRRN IPCFEEYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRRK 

51 YPYPMDIPRD IVIGIGTIID FLMVPNW ELF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 425>: 

mll8.seq 

1 ATGTGTGAGT TCAAGGATAT TATAAGAAAC GTTCCTTATT TTGAGGGGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAGTTG GAGAATGATT TAATCGAGGT TAGAAAAAAA 

151 TATCCTTATC CGATGGACAT ACCAAGATAT GTTGTCATTG GAATCGGTAC 

201 CATTATTGAT TTCTTAATGG TTCCAAATTG GAAACTTTTT GAAATTAAAG 

251 CTTCCCCTTG GTTGCCTGAT AGTGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAACA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 426; ORF 1 18>: 

ml 18. pep 

1 MCEFKDIIRN VPYFEGYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRKK 

51 YPYPMDIPRY WIGIGTIID FLMVPNW KLF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 118 shows 92.8% identity over a 125 aa overlap with a predicted ORF (ORF 1 18.ng) 
from N. gonorrhoeae: 

mll8/gll8 



ml 18 . pep 
gll8 



10 20 30 40 50 60 

MCEFKDIIRNVPYFEGYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPYPMDIPRY 
11 1 1 1 1 : M : I I J I M I 1 I I I I I I I I | | | U 1 I I M I I M 1 I I M : I I I I I | M | I 
MCEFKDFRRNIPCFEEYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRRKYPYPMDII 

10 20 30 40 



50 



CPRD 
60 



70 80 90 100 110 120 

mll8.pep WIGIGTIIDFLMVPNWKLFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 
• 1 f I M I M I I I I I f M : i | | | | | | M j M I I I I I I I I M I I j I M j M M II M ! I I M 
g!18 IVIGIGTIIDFLMVPNWELFEIPCASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 

70 80 90 100 



110 



120 



ml 18 .pep YYNKKX 
I I I I I I 

gH8 YYNKKX 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 427>- 
all8.seq 

1 ATGTGTGAGT TCAAGGATTT TAGAAGAAAC ATCCCTTGTT TTGAAGAGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAATTG GAGAATGATT TAATCGAGGT TAGAAAAAAA 

151 TATCCTTATC CGATGGATAT ACCAAGGGAT ATTGTGATTG GAATCGGTAC 

201 CATTATTGAT TTTTTAATGG TTCCAAATTG GGAGCTTTTT GAAATTAAAG 

251 CTTCCCCTTG GTTGCCTGAT AGTGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAACA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 428; ORF 1 18.a>: 

all8.pep 

1 MCEFKDFRRN IPCFEEYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRKK 

51 YPYPMDIPRD IVIGIGTIID FLMVPNW ELF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 

mll8/all8 93.6% identity in 125 aa overlap 



10 20 30 40 50 60 

mllS.pep MCEFKDIIRNVPYFEGYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPYPMDIPRY 

1 1 1 1 1 1 : M : 1 1 ' I I I I I I I I I I I I I I I M I I M II I I I I II I M I I I I I I I I I 
all8 MCEFKDFRRNIPCFEEYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPY^ 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m!18 .pep VVIGIGTIIDFLMVPNWKLFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 
: I I M I I I 1 I I I I I I 1 I : 1 I I I I ! I I I I I I M I I I I I f t I I I I I I I I 11 I I I I I I I I i I I 
all8 IVIGIGTIIDFLMVPNWELFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 

70 80 90 100 110 120 

ml 18. pep YYNKKX 
II 1 I It 

all 8 YYNKKX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 429>: 

gl20 . seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAAGG CTACC CCAATCCGCC GTGCTGCACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATG CTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAATCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTGCCTACT 

251 ATAAAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCCAA ATTCGCCGAC 

3 01 GGCAGCGTAA CCTACGGCAA AGCGGGCGAG AG CAAAAC CG AGCAAAGCCC 

351 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

401 CGAAACTCCC CCCGGGTCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 

451 GTCGGCGGCC TGAATAAGGC GGGTACGGGA AAATACAGCA Taggcggcgt 

501 gGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATACGGTAA 

551 CGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 

6 01 ACCGAcgaCG GCAAAACCTA TACGCTGAAG CTCAAATCGG TGCAGATCAA 

651 CGGACAGGCC GCCAAACCGT AA 

This corresponds to the amino acid sequence <SEQ ID 430; ORF 120.ng>: 

gl20 .pep 

1 MMKTFKNIFS AAILSAALPC AYA ARLPQSA VLHYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PAYYKDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DTVTYFFAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 43 1>: 

ml 20 . seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAGGGCTGCC CCAATCCGCC GTGCTGmACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

2 01 TTTCGAGTCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTACCTACT 
251 ATAGAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCcAA ATTCGCCGAC 
301 GGCAGCGTAA CTTACGGCAA AGCGGGCGAG AG C AAAACCG AGCAAAGCCC 

3 51 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

4 01 CGAAACTCCC CCCGGGGCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 

4 51 GTCGGCGGTT TGAATAAGGC GGGTACAGGA AAATACAGCA TAGGCGGCGT 

5 01 GGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATGCGGTAA 
551 TGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 
601 ACCGACGACG GCAAAACCTA TACGCTGAAA CTCAAATCGG TGCAGATCAA 
651 CGGCCAGGCA GCCAAACCG 

This corresponds to the amino acid sequence <SEQ ID 432; ORF 120>: 

tnl20 - pep 

1 MMKTFKNIFS AAILSAALPC AYA AGLPQSA VLXYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PTYYRDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DAVMYFFAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP 



BNSDOCID: <WO_9957280A2lL> 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 120 shows 97.3% identity over a 223 aa overlap with a predicted ORF (ORF 120.ng) 
from N. gonorrhoeae: 

ml20/gl20 

10 20 30 40 50 60 

m!20.pep MMKTFKNIFSAAIIiSATUjPCAYAAGLPQSAVLXYSGSYGIPATMTFERSGNAYKIVSTTK" 

IIINIIIIIIIIIIMIilllll Mlllll I I | f 1 i f 1 I I i I I I I 1 I I 1 f i {] t I I j 

9120 MMKTFKNIFSAAI LSAALPCAYAARLPQSAVLHYSGSYGIPATMTFERSGNAYKIVSTIK 

10 20 30 40 50 60 



ml20 .pep 
g!20 



70 80 90 100 no 120 

VPLYNI RFESGGTWGNTLHPTYYRDI RRGKLYAEAKFADGS VTYGKAGES KTEOS PKAM 

NMIIIIMIiMMIIIihlhlillMMMIIillMIIIMIIIMIIIMiM 

VPLYNIRFESGGTVVGNTLHPAYYKDIRRGKLYAEAJCFADGSVTYGKAGESKTEQSPKAM 
70 80 90 100 . HQ 120 



"0 140 150 160 170 180 

ml20.pep DLFTIAWQLAANDAKXjPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEVVKYRVRRGD 



gl20 



DLFTIAWQLAANDAKLPPGLKITNGKKLYSVGGI^KAGTGKYSIGGVETEWKYRVRRGD 
130 140 150 160 170 180 



190 200 210 220 

ml20 .pep DAVMYFFAPS LNNI PAQ I G YTDDGKTYTLKLKS VQ I NGQAAKP 

l ; f M I M 1 1 M 1 1 1 1 1 1 1 M 1 1 1 M M M i I f 1 M 1 M I M 

9120 DTVTYFFAPSLNNI PAQI G YTDDGKTYTLKLKS VQ INGQAAKPX 

190 200 210 220 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 433>- 
al20. seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAGGGCTGCC CCAATCCGCC GTGCTGCACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAGTCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTACCTACT 

251 ATAGAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCCAA ATTCGCCGAC 

301 GGCAGCGTAA CCTACGGCAA AGCGGGCGAG AGCAAAACCG AGCAAAGCCC 

351 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

4 01 CGAAACTCCC CCCGGGGCTG AAAATCAC CA ACGGCAAAAA ACTTTATTCC 

4 51 GTCGGCGGTT TGAATAAGGC GGGTACAGGA AAATACAGCA TAGGCGGCGT 

501 GGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATGCGGTAA 

551 TGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 

601 ACCGACGACG GCAAAACCTA TACGCTGAAA CTCAAATCGG TGCAGATCAA 

651 CGGCCAGGCA GCCAAACCGT AA 

This corresponds to the amino acid sequence <SEQ ID 434; ORF 120.a>: 
al20.pep 

1 MMKTFKNIFS AAILSAALPC AYAA GLPOSA VLHYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PTYYRDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DAVMYFFAPS LNNI PAQI GY 

201 TDDGKTYTLK LKSVQINGQA AKP* 

ml20/al20 99.6% identity in 223 aa overlap 

10 20 30 40 50 60 

ml20.pep MMKTFKNIFSAAILSAALPCAYAAGLPQSAVLXYSGSYGIPATMTFERSGNAYKIVSTIK 
"IMIMIIIIMIIIIIIIIMNIIIMI I I I I II I I I I I M M I f I 1 I I | M I I I 
al20 MMKTFKNIFSAAILSAALPCAYAAGLPQSAVLHYSGSYGIPATMTFERSGNAYK^ 
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10 20 30 40 50 60 

70 80 90 100 110 120 

ml 20 . pep VPLYNIRFESGGTWGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 
II I II I I I I I I I I t I I I I I I I M I I ! I I I I I I I I I I I I I I I I I j I II II I I I I M I 11 M 
al20 VPLYNIRFESGGTVVGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml20 . pep DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEWKYRVRRGD 
I I I I M I I I I I I I I I M M I M I I I I I I I M I I I I If I I I I I ! I I I I I I I I I I I I M I I I 
al20 DLFTLAWQLAANDAKLPPGLKXTNGKKLYSVGGLNBCAGTGKYSIGGVETEWKYRVRRGD 

130 140 150 160 170 180 

190 200 210 220 

ml 2 0 . pep DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 
I M I I I I I I I I I I I I M I I I I I I f I I M IE I I I I I I I I I I I I I I 
al20 DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 

190 200 210 220 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ LD 435>: 

gl21 . seq 

1 ATGGAAACAC 

51 GGCGGATGCC 

101 AAGGGCACGC 

151 GATTTGCAGG 

201 GCAAGAACTC 

251 GTCAAAACCT 

301 ACCGTCCGAC 

351 GCCGCTGCTG 

4 01 GCCGCGACCT 

4 51 CACGAAGCCC 

501 CGGCGGGATT 

551 GCTTCGACAC 

601 cacTGGcagc 

651 catatTGCcg 

7 01 AACCCcaccc 

751 gaaacctAcc 

801 ttcccgattc 

851 CAGATGCCCG 

901 TTAATGGCGG 

951 CACCGCCGAA 

1001 cgtggttggC 

1051 GCGACCGGCG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 436; ORF 121.ng>: 

gl21 .pep 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFTP YPDRLRRKLL 

51 DLQDTGTDEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRVVLN I GG I ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

3 51 ATGASKPCIL GAGYYY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 43 7>: 

ml 21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 



AGCTTTACAT CGGCATTATG 
GTGCTGGTAC GGATGGACGG 
CTTTACCCCC TACCCTGACC 
ACACAGGCAC AGACGAACTG 
AGCCGCCTGT ACGCGCAAAC 
CGCTCCGTGC GACATTACCG 
ACGCGCCGGA ACACGGTtac 
GCGGAACTGa cgcggatttT 
TGCTGCCGGC GGacaAGGTG 
TGTTCCGCGA TGACAGGGAA 
GCCAACATCA GCGTACTCCC 
AGGGCCGGGC AATATGCTGA 
TGCCTTACGA CAAAAacggt 
cAACTGCTCG gcaggctGCT 
aaAAAGCACG GGgcGCGaac 
ttgacggcgg cgaaaaccga 
accgcgcaaA ccgTttggga 
TCAAATGTAC ATTTGCGGCG 
ATTTGGCAGA ATGTTTCGGC 
CTGAACCTCG ATCCTCAATG 
GGCGTGTTGG ATTAACCGCA 
CATCCAAACC GTGTATTCTG 



TCGGGAACCA GTATGGACGG 
CGGCAAATGG CTGGGCGCGG 
GGTTGCGCCG CAAATTGCTG 
CACCGCAGCA GGATGTTGTC 
CGCCGCCGAA CTGCTGTGCA 
CCCTCGGCTG CCACGGGCAA 
AGCATACAGC TTGCCGATTT 
TACCGTCggc gacttcCGCA 
CGCCGCTCGT CCCCGCCTTT 
ACACGCGTGG TACTGAACAT 
CCCCGGCGCA CCCGCCTTCG 
TGGAcgcgtg gacgcaggca 
gcAAAGgcgg cacAAGGCAA 
CGCCcaccCG TATTTCTCAC 
TgtttgcccT AAattggctc 
tacgacgtat tgcggacgct 
cgccgtctca CACGCAGCGG 
GCGGCATCCG CAATCCTGTT 
ACACGCGTTT CCCTGCACAG 
GGTGGAGGCG gccgCATTtg 
TTCCCGGTAG TCCGCACAAA 
GGCGCGGGAT ATTATTATTG 
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151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

2 01 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

4 01 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

4 51 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

501 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

551 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

601 xxxxxxCAGC TTCCTTACGA CAAAAACGGT GCAAAGTCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

7 01 AACGCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCAT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGACG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGnATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG AnCGCGGGAT AT TAT TAT TG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 438; ORF 12 1>: 

ml21.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL Axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

201 xxQLPYDKNG AKSAQGNILP QLLDRLLAHP YFAQRHPKST GRELFAINWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICDGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

Computer analysis of the amino acid sequences gave the following results: 
Homology with a predicted ORF from K meningitidis menA with menB 
ORF 121 shows 73.5% identity over a 366 aa overlap with a predicted ORF (ORF121.ng) 
from N. gonorrhoeae: ' 

ml21/gl21 



10 20 30 40 50 60 

METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFT PYPGRLRRQLLDLODTGADEL 
'"If llllllMIIIIIIMCMIIIMMiMIIIIIM I I I I : I M I I I I I ■ I I I 
METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFT PYPDRLRRKLLDLQDTGTDEL 
10 20 30 40 50 60 

70 80 90 100 HO 120 

ml21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
N I I = I I I I I I II I I I I I I I | | | | | | | | | ! M I I II I I I I i I I I I | | M I I I M I II I | 
HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 
70 80 90 100 110 120 

130 140 150 160 170 180 

AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 !40 150 160 170 180 

190 200 210 220 230 240 

XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
1 : M I II I I I I I : i I j | M | | If I I I I I I || : I I I II 1 

PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 
190 200 210 220 230 240 

250 2 60 270 280 290 30 0 

GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
lim,:MI ""l'HMIIIMIIIIIIIMI II II Ml II I! I II If | | mil 

GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 
250 260 270 280 290 300 



ml21.pep 
g!21 



gl21 



ml21 .pep 



ml21.pep 
gl21 

ml21 . pep 
gl21 
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310 320 330 340 350 360 

ml 21 .pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I \ I I 
gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
I I I I I I 

gl21 GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 439>: 

al21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

10 01 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGG AT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ED 440; ORF 121.a>: 

al21.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

2 01 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

2 51 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml21/al21 74.0% identity in 366 aa overlap 

10 20 30 40 50 60 

ml21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I I I M I I I I I II I I I I I I I I I I I I I I M I I I I I I I I I I I ! I I I I I : M I I I I I I I I I I 
al21 METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I I I I : I I M 11 I I I I t I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I : I I : I I I I I I I I 
al21 HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 
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130 140 150 160 170 180 

mi 2 1 . pep axxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 



al2l 



al21 



AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 
13 ° 140 150 160 170 180 



190 200 210 220 230 240 

ml 21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: M M I I M It : I I I M M i I M M M I I I I M M M I 

3121 PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQ°HPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 21. pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I ' I M I : I I I I I I M I I M | ] | | | M I I I I II I I I I I M I I I I II I I | M | | | | j | n 
GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml21.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHPCATGASKPCIL 
MllililllMIIMIICMMIIMIM I I I : I I I I : j I I I j ] I I I i I M ! | | | | I 
a!21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 2 1 . pep XAGYYYX 
MINI 

a!21 GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 441>- 

ml21-l . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CGCGGATTTT TACCGTCGGC GACTTCCGCA 

401 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCACTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CAACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGACGCGTG GACGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGNATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG ANCGCGGGAT ATT ATT ATT G 



This corresponds to the amino acid sequence <SEQ ID 442- ORF 121-1>- 

ml21-l.pep ' 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AERTR1FTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDNRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWT OA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* - - 

ml21-l/gl21 95.6% identity in 366 aa overlap 

10 20 30 40 50 60 
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WO 99/57280 PCT/US99/09346 
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ml21-l .pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 

I I I I I f I i I I I t I I I I i I I I I I : I I I I I I I I 1 I f I I I i I I I I I I I I : I I I 1 I I I I : I I I 
g 12 1 METQLYI G IMSGTSMDGADAVLVRMDGGKWLGAEGHAFT P YPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 2 1-1 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
M I I : I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I 1 I I 1 I I I I I I I II I I I 
gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 - 1 . pep AERTRI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNI GGI ANI SVLPPDA 

II t I I I I I I I I I I I I I II I i I II I II I I I I I I I I I I : I I I I : I I I I II I I I I I I I I I I 
gl21 AELTRI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRVVLNIGGIANI SVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21-l . pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
I I I I I I I I I II I I I I I I I I I I I I I I I M I I II I I I I I I I I I I I 11111111:1111111 
gl21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l . pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g 1 2 1 GRELFALNWLET YLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMY I CGGGI RNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l .pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I I I I I I I I I I I I I I I I I : II M I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I ! I 
gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml21-l .pep 
gl21 



XAGYYYX 
I I I I I I 
GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 443>: 

al21-l . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGT T AC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

7 51 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 



This corresponds to the amino acid sequence <SEQ ID 444; ORF 121-l.a>: 

al21-l .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 
51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 
101 TVRHAPEHSY SVQLADLPLL AERTQI FTVG DFRSRDLAAG GQGAPLVPAF 
151 HEALFRDDRE TRAVLNIGGI ANI SVLPPDA PAFGFDTGPG NMLMDAWMQA 
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201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml2l-l/al21-l 96.4% identity in 366 aa overlap 

10 20 30 40 50 60 

ml21-l . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
1 1 I I I I I i I II I I I 1 I I I j ! I I f | | M I ! I I j I | | | j | I I j I | j | | j : | | | |; | | , t | | | 
METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 
10 20 30 40 50 60 



al21-l 



70 80 90 100 HO 120 

ml21-l.pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
IMCiillli IMIIIIIII llllll HUM Mini IIMi Mll:i|:!|||il II 
a!21-l HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l.pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGIANISVLPPDA 
, I'' H II I II III I II ill! ||f Mint I |||[: M || till II! ill til ill I! 

al21-l AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 
130 1^0 150 160 170 180 

190 200 210 220 230 240 

ml2 1 - 1 . pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

101 , nmilllllllllll IIMIfllMIIIIIIIMIHIMI IIIIMIIMI 

al21-l PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGWILPQLLDRLLAHPYFAQPHPKST 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l.pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
HMIIIilMMI III! MlilliIMM | M I! I I II I I I I I I I I I I I I I I I I I I I I 
GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 
250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGS PHKATGASKPCIL 
IMIIIIIIIIIIIIIIIIillllliiiiii lll:III|:l!llll]||!l|||||ll! 
LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 
310 320 330 340 350 360 



al21-l 



al21 



ml21-l .pep XAGYYYX 
I I I I I I 

al21 GAGYYYX 



gl22 .seq 








1 


ATGGCTTTAC 


TGAGCATCCG 


CAAGCTGCAC 


51 


CGCCATCCAA 


TCCTTAGACT 


TGGACTTGGA 


101 


TGCTGGGCCC 


gTccggctgc 


ggCAAATCCA 


151 


GGTTTGGAGC 


CGCACCAagg 


cgGCAGCATC 


201 


ATTCg g c AAA 


GACGTTTCCT 


GGCAAACCGC 


251 


tctttcaaag 


taacgAactg 


Tttgcccaca 


301 


ttcttAggcC 


CGGTAAagga 


aCAAAAcCgc 


351 


gCAAGCCGGC 


AAactGttgg 


aacgcgTCGG 


401 


CCTATCCGCG 


CGAACTTTCC 


GGCGGTCAGA 


451 


CGCGCCCTGT 


GCCTGAATCC 


GGAAGTCATC 


501 


CGCACTTGAC 


CCCGAAATGG 


TGCGCGAAGT 


551 


TCGCCCGCGA 


AGGGATGAGT 


ATGCTCATCG 


• • -6-01 


GCAGGCAAAG 


TTGCCGACCG 


CATCGTCTTT 


651 


CGAATCGTCC 


GACCCCGAAA 


CCTTTTTTTC 


701 


CCCGCCAATT 


TCTGGCAGGT 


ATGGACTACT 



This corresponds to the amino acid sequence <SEQ ID 446; ORF 122.ng>: 
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g!22 .pep 



1 MALLSIRKLH KQYGSVTAIQ SLDLDLEKGE VIVLLGPSGC GKSTLLRCVN 

51 GLEPHQGGSI VMDGVGE FGK DVSWQTARQK VGMVFQSNEL FAHMTVIENI 

101 FLGPVKEQNR DRAEAEAQAG KLLERVGLLD RKNAYPRELS GGQKQRIAIV 

151 RALCLNPEVI LLDEITAALD PEMVREVLEV VLELAREGMS MLIVTHEMGF 

201 ARKVADR I VF MDKGGIVESS DPETFFSAPK SERARQFLAG MDY * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 447>: 



1 GTTGTCATGA TTAAAATCCG CAATATCCAT AAGACCTTTG GCGAAAACAC 

51 TATTTTGCGC GGCATCGATT TGGATGTGTG CAAAGGGCAG GTGGTCGTCA 

101 TCCTCGGGCC TTCCGGCTCA GGCAAAACGA CGTTTCTGCG ATGCCTAAAC 

151 GCGTTGGAAA TGCCCGAAGA CGGACAAATC GAGTTCGACA ACGAGCGACC 

201 GCTGAAAATC GATTTTTCTA AAAAACCAAG CAAACACGAT ATTTTGGCAC 

251 TGCGCCGCAA ATCAkGCATG GTGTTTCAAC AATACAAyCT CTTTCCGCAC 

3 01 AAAACCGCCT TGGAAAACGT AATGGAAGGA CCGGTTGCCG TACAgGGCAA 

3 51 GCCTGCCGCC CAAGCGCGCG AAGAGGCTCT GAAACTGCTG GAAAAAGTCG 

4 01 GCTTGGGCGA CAAAGTGGAT TTGTATCCCT ACCAGCTTTC CGGCGGTCAG 
4 51 CAGCAGCGCG TCGGCATTGC CCGCGCATTG GCGATTCAGC CTGAACTGAT 
501 GCTGTTTGAC GAACCGACTT CCGCGCTCGA TCCTGAATTG GTGCAAGATG 
551 TTTTGGATmC CATGAAGGAA TTGGCGCAAG AAGGCTGGAC CATGGTTGTC 
601 GTTACGCATG AAATCAAGTT CGCCTTAGAA GTGGCAAC C A CCGwCGTCGT 
651 GATGGACrGC GGCGTTATTG TCGAACAAGG CAGCCCGCAA GATTTGTTCG 
701 ACCACCCCAA ACACGAACGG ACGCGGAGAT TTTTAAGCCA AATCCAATCT 
751 ACCAAGATTT GA 



This corresponds to the amino acid sequence <SEQ ID 448; ORF 122>: 

ml22 .pep 

1 WMIKIRNIH KTFGENTILR GIDLDVCKGQ VWILGPSGS GKTTFLRCLN 



51 ALEMPEDGQI EFDNERPLKI DFSKKPSKHD ILALRRKSXM VFQQYNLFPH 

101 KTAIiENVMEG PVAVQGKPAA QAREEALKLL EKVGLGDKVD LYPYQLSGGQ 

151 QQRVGIARAL AIQPELMLFD EPTSALDPEL VQDVLDXMKE LAQEGWTMW 

201 VTHEIKFALE VATTXWMDX GVIVEQGSPQ DLFDHPKHER TRRFLSQIQS 

251 TKI* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 122 shows 47.2% identity over a 246 aa overlap with a predicted ORF (ORF 122.ng) 
from N. gonorrhoeae: 

ml22/g!22 



m!22 . seq 



ml22 .pep 



10 20 30 40 50 60 

WMI KI RNIHKTFGENTI LRGIDLDVCKGQVWI LGPSGSGKTTFLRCLNALEMPEDGQ I 



9122 



MALIjSIRKLHKQYGSVTAIQSLDLDLEKGEVIVLLGPSGCGKSTLLRCVNGLEPHQGGSI 
10 20 30 40 50 60 




ml22 .pep 



70 80 90 100 110 120 

EFDNERPLKIDFSKKPSKHDILALRRKSXMVFQQYNLFPHKTALENVMEGPVAVQGKPAA 



gi22 





70 80 90 100 110 



ml22 .pep 



130 140 150 160 170 180 

QAREEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPEL 



gi22 




120 130 140 150 160 170 



ml2 2 -pep 



190 200 210 220 230 240 

VQDVLDXMKE LAQEGWTMVVVTHEIKFALEVATTXVVMDXGVIVEQGSPQDLFDHPKHER 



gi22 
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350 



180 190 200 210 



220 230 



250 

ml22 .pep TRRFLSQIQSTKIX 

= 1 = 11 = 

912 2 ARQFLAGMDYX 
240 



al22 . seq 








1 


GTTGTCATGA 


TTAAAATCCG 


CAATATCCAT 


51 


CATTTTGCGC 


GGCATCAATT 


TGGATGTGTG 


101 


TCCTCGGGCC 


TTCCGGCTCA 


GGCAAAACGA 


151 


GCGTTGGAAA 


TGCCCGAAGA 


CGGACAAATC 


201 


GCTGAAAATC 


GATTTTTCTA 


AAAAACCAAG 


251 


TGCGCCGCAA 


ATCAGGCATG 


GTGTTTCAAC 


301 


AAAACCGCCT 


TGGAAAACGT 


GATGGAAGGA 


351 


GCCTGCCGCC 


CAAGCGCGCG 


AAGAGGCTCT 


401 


GCTTGGGCGA 


CAAAGTGGAT 


TTGTATCCCT 


451 


CAGCAGCGCG 


TCGGCATTGC 


CCGAGCATTG 


501 


GTTGTTTGAC 


GAACCCACTT 


CCGCGCTTGA 


551 


TGTTGAACGC 


CATGAAGGAA 


TTGGCGCGGG 


601 


GTTACCCACG 


AAATCAAGTT 


CGCGCTGGAA 


651 


GATGGACGGC 


GGCGTTATCG 


TAGAGCAGGG 


701 


ACCACCCCAA 


ACACGAACGG 


ACGCGGAGAT 


751 


ACCAAGATTT 


GA 





This corresponds to the amino acid sequence <SEQ ID 450; ORF 122.a>* 

al22.pep 

1 WMIKIRNIH KTFGKNTILR GINLDVCKGQ VWILGPSGS GKTTFLRCLN 
51 ALEMPEDGQI EFDNERPLKI DFSKKPSKHD ILALRRKSGM VFQQYNLFPH 
101 KTALENVMEG PVAVQGKPAA QAREEALKLL EKVGLGDKVD LYPYQLSGGQ 
151 QQRVGIARAL AIQPELMLFD EPTSALDPEL VQDVLNAMKE L ARE GWTMVV 
201 VTHE IK FALE VATTVWMDG GVIV EOGSPTf ELFDHPKHER TRRFLSOIOS 
251 TKI* 

ml22/al22 96.0% identity in 253 aa overlap 

10 20 30 40 50 60 

ml22. P ep VVMIKIRNIHKTFGENTILRGIDLDVCKGQWVILGPSGSGKTTFLRCLKALEMPEDGOI 
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al22 TRRFLSQIQSTKIX 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 45 1>: 

gl22-l.seq 

1 ATGATTAAAA TCCGCAATAT CCATAAGACC TTTGGCGAAA ACACCATTTT 

51 GCGCGGCATC GATTTGGATG TGGGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCGGGTAAA ACAACATTTC TGCGCTGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGCGC GGCCGTTACG 

201 CATTGATTTT TCCAAAAAAA CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAGTCCGG AATGGTATTC CAACAATACA ACCTCTTCCC GCATAAAACC 

301 GTGTTGGAAA ACGTGATGGA AGGGCCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTTTGAAACT GCTGGAAAAA GTCGGCTTGG 

4 01 GCGATAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

451 CGTGTCGGTA TCGCCCGCGC ACTGGCGATT CAGCCTGAAT TGATGCTGTT 

501 TGACGAACCC ACTTCCGCGC TGGACCCCGA GTTGGTGCAA GACGTGTTGG 

551 ACGCCATGAA GGAATTGGCG CGGGAAGGTT GGACGATGGT CGTCGTTACC 

601 CACGAAATCA AGTTCACGCT GGAAGTTGCC ACGAACGTCG TCGTGATGGA 

651 CGGCGGCGTT ATCGTAGAGC AGGGCAGCCC GAAAGAGTTG TTCGACCACC 

701 TCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTGCCAAG 

7 51 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 452; ORF 122-1. ng>: 

gl22-l.pep 

1 MIKIRNIHKT FGENTILRGI DLDVGKGQVV VILGPSGSGK TTFLRCLNAL 

51 EMPEDGQIEF DNARPLRIDF SKKTSKHDIL ALRRKSGMVF QQYNLFPHKT 

101 VLENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 

151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDAMKELA REGWTMVWT 

201 HEIKFTLEVA TNWVMDGGV IVEQGSPKEL FDHLKHERTR RFLSQIQSAK 

251 I* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 453>: 

ml22-l . seq 

1 ATGATTAAAA TCCGCAATAT CCATAAGACC TTTGGCGAAA ACACTATTTT 

51 GCGCGGCATC GATTTGGATG TGTGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCAGGCAAA ACGACGTTTC TGCGATGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGAGC GACCGCTGAA 

201 AATCGATTTT TCTAAAAAAC CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAATCAGG CATGGTGTTT CAACAATACA ACCTCTTTCC GCACAAAACC 

301 GCCTTGGAAA ACGTAATGGA AGGACCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTCTGAAACT GCTGGAAAAA GTCGGCTTGG 

401 GCGACAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

451 CGCGTCGGCA TTGCCCGCGC ATTGGCGATT CAGCCTGAAC TGATGCTGTT 

501 TGACGAACCG ACTTCCGCGC TCGATCCTGA ATTGGTGCAA GATGTTTTGG 

551 ATACCATGAA GGAATTGGCG CAAGAAGGCT GGACCATGGT TGTCGTTACG 

601 CATGAAATCA AGTTCGCCTT AGAAGTGGCA ACCACCGTCG TCGTGATGGA 

651 CGGCGGCGTT ATTGTCGAAC AAGGCAGCCC GCAAGATTTG TTCGACCACC 

701 CCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTACCAAG 

751 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 454; ORF 122- 1>: 

ml22-l.pep 

1 MIKIRNIHKT FGENTILRGI DLDVCKGQVV VILGPSGSGK TTFLRCLNAL 

51 EMPEDGQIEF DNERPLKIDF SKKPSKHDIL ALRRKSGMVF QQYNLFPHKT 

101 ALENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 

151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDTMKELA QEGWTMVWT 

201 HEIKFALEVA TTWVMDGGV IVEQGSPQDL FDHPKHERTR RFLSQIQSTK 
251 I* 

ml22-l/gl22-l 94.8% identity in 251 aa overlap 
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ml22-l.pep MIKIRNIHKT FGENTILRGIDLDVCKGQVWILGPSGSGKTTFLRCLNALEMPEDGQIEF 
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gl22-l REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml22-l .pep DVLDTMKELAQEGWTMVVVTHEIKFALEVATTWVMDGGVIVEQGSPQDLFDHPKHERTR 
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gl22-l DVLDAMKELAREGWTMVVVTHEIKFTLEVATNWVMDGGVIVEQGSPKELFDHLKHERTR 
190 200 210 220 230 240 

250 

ml22-l.pep RFLSQIQSTKIX 
I I I I I M I : I I I 
g!22-l RFLSQIQSAKIX 
250 

The following partial DNA sequence was identified in TV. meningitidis <SEO ID 455>- 

al22-l.seq ^ 

1 ATGATTAAAA TCCGCAATAT CCATAAGACC TTCGGCAAAA ATACCATTTT 

51 GCGCGGCATC AATTTGGATG TGTGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCAGGCAAA ACGACGTTTC TGCGATGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGAGC GACCGCTGAA 

201 AATCGATTTT TCTAAAAAAC CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAATCAGG CATGGTGTTT CAACAATACA ACCTCTTTCC GCACAAAACC 

301 GCCTTGGAAA ACGTGATGGA AGGACCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTCTGAAACT GCTGGAAAAA GTCGGCTTGG 

4 01 GCGACAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

4 51 CGCGTCGGCA TTGCCCGAGC ATTGGCGATT CAGCCCGAGC TGATGTTGTT 

501 TGACGAACCC ACTTCCGCGC TTGACCCCGA GTTGGTGCAA GACGTGTTGA 

551 ACGCCATGAA GGAATTGGCG CGGGAAGGTT GGACGATGGT CGTCGTTACC 

601 CACGAAATCA AGTTCGCGCT GGAAGTTGCC ACGACCGTTG TCGTGATGGA 

651 CGGCGGCGTT ATCGTAGAGC AGGGCAGCCC GAAAGAGTTG TTCGACCACC 

701 CCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTACCAAG 

7 51 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 456; ORF 122-l.a>: 

al22-l .pep ' 

1 MIKIRNIHKT FGKNTILRGI NLDVCKGQVV VILGPSGSGK TTFLRCLNAL 
51 EMPEDGQIEF DNERPLKIDF SKKPSKHDIL ALRRKSGMVF QQYNLFPHKT 
101 ALENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 
151 RVG I ARALAI QPELMLFDEP TSALDPELVQ DVLNAMKELA REGWTMVWT 
251 ^ IKFALEVA TTVVVMDGGV IVEQGSPKEL FDHPKHERTR RFLSQIQSTK 

a!22-l/ml22-l 97.2% identity in 251 aa overlap 

1° 20 30 40 50 60 

al22-l.pep MIKIRNIHKTFGKNTILRGINLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 45 7>: 

gl25 . seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCGCCGCCA TCGGGCTGGT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACGCTGC 

101 TCGCCCCCTT GGGCTGGCAG CGCGGTCTGG CGGCCCTGCT TTTGGGTCAT 

151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

2 01 CGGACG CAGC TCGATGGAAA GTGTGCGCCT GTCGTTCGGC AAATGCGGTT 
251 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

3 01 GTGATGATTT ACGTCGGCGC AacggTCAGC TCCGCTTTGG GCAAAGTGTT 

3 51 GTGGGACggc gaATCCTTTG TCTGGTGGGC ATTGG CAAAC GGCGCACTGA 

4 01 TCGTGCTGTG GCTGGTTTTC GGCGCACGCA GAACGGGCGG GCTGAAAACC 
4 51 GTTTCGATGC TGCTGATGCT GCTTGCCGTG TTGTGGTTGA GCGTCGAAGT 
501 GTTCGCTTCG TCCGGCACAA ACGCCGCGCC CGCCGTTTCA GACGGCATGA 
551 CCTTCGGAAC GGCAGTCGAA CTGTCCGCCG TCATGCCGCT TTCCTGGCTG 
601 CCGCTGGCCG CCGACTACAC GCGCCAAGCA CGCCGCCCGT TTGCGGCAAC 
651 CCTGACGGCA ACGCTCGCCT ATACGCTGAC GGGCTGCTGG ATGTATGCCT 
701 TGGGTTTGGC GGCGGCTCTG TTTACCGGAG AAACCGACGT GGCGAAAATC 
751 CTGTTGGGCG CGGGCTTGGG CATAACGGGC ATTCTGGCAG TCGTCCTCTC 
801 CACCGTTACC ACAACGTTTC TCGATACCTA TTCCGCCGGC GCGAGTGCGA 
851 ACAACATTTC CGCGCGTTTT GCGGAAATAC CCGTCGCTGT CGGCGTTACC 
901 CTGatccgca ccgtgcttgc cgtcatgctg cccgttaccg aatataaaaa 
951 cttcctgctg cttatccgct cggtatttgg gccgatggcg ggtggttttg 

1001 attgccgaCT TTTttgtctt AAAACGGCGT GA 

This corresponds to the amino acid sequence <SEQ ID 458; ORF 125.ng>: 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 459>: 

ml25 .seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCTCCGCCA TCGGGCTGAT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACGCTGC 

101 TTGCGCCTTT GGGCTGGCAG CGCGGTCTGG CGGCTCTACT TTTGGGTCAT 

151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

201 CGGACGCAGC TCGATGGAAA GCGTGCGCCT GTCGTTCGGC AAACGCGGTT 

251 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

3 01 GTGATGATTT ACGCCGGCGC AACGGTCAGC TCCGCTTTGG GCAAAGTGTT 

3 51 GTGGGACGGC GAATCTTTTG TCTGGTGGGC ATTGGCAAAC GGCGCGCTGA 

4 01 TTGTGCTGTG GCTGGTTTTC GGCGCACGCA AAACAGGCGG GCTGAAAACC 

4 51 GTTTCGATGC TGCTGATGCT GTTGGCGGTT CTGTGGCTGA GTGCCGAAGT 
501 CTTTTCCACG GCAGGCAGCA CCGCCGCACA GGTTTCAGAC GGCATGAGTT 

5 51 TCGGAACGGC AGTCGAGCTG TCCGCCGTGA TGCCGCTTTC CTGGCTGCCG 
601 CTTGCCGCCG ACTACACGCG CCACGCGCGC CGCCCGTTTG CGGCAACCCT 

6 51 GACGGCAACG CTCGCCTACA CGCTGACCGG CTGCTGGATG TATGCCTTGG 
701 GTTTGGCAGC GGCGTTGTTC ACCGGAGAAA CCGACGTGGC AAAAAT CCTG 
751 CTGGGCGCAr GTTTGgGTGC GGCAGGCATT TTGGCGGTCG TCCTCTCCAC 
801 CGTTACCACA ACGTTTCTCG ATGCCTATTC CGCCGGCGCG AGTGCGAACA 
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851 ACATTTCCGC GCGTTTTGCG GAAACACCCG TCGCTGTCrG CGTTACCCTG 

901 ATCGGCACGG TACTTGCCGT CATGCTGCCC GTTACCGAAT ATGAAAACTT 

951 CCTGCTGCTT ATCGGCTCGG TATTTGCGCC GATGGCGGgC GGTTTTGATT 

10 01 GCCGACTTTT TCGTCTTGAA ACGGCGTGA 

This corresponds to the amino acid sequence <SEQ ID 460; ORF 125>: 

ml25.pep ' 

1 MSGNASSPSS SSAIGLIWFG AAVSIAEIST GTLLAPLGWQ RGLAALLLGH 

51 AVGGAXiFFAA AYIGALTGRS SMESVRLSFG KRGSVLFSVA NMLQLAGWTA 

101 VMIYAGATVS SALGKVLWDG ES FVWWALAN GALIVLWLV F GARKTGGLKT 

151 VSMLL MLLAV LWL5AEVF ST AGSTAAQVSD GMSFGTAVEL SAVMPLSWLP 

2 01 LAADYTRHAR RPFAATLTAT LAYTLTGCWM YALGLAAALF TGETDVAKIL 
251 LGAXLGAAGI LAWLSTVTT TFLDAYSAGA SAMNI5AR FA ETPVA VXVTL 

3 01 IGTVIiAVMLP VTEYENFLLL IGSVFAPMAG GFDCRLFRLE TA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 125 shows 92.1% identity over a 343 aa overlap with a predicted ORF (ORF 125 ne) 
from N. gonorrhoeae: ' &J 

ml25/gl25 
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The following partial DNA sequence was identified in N. meningitidis <SEOID461>- 

al25.seq ^ • 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCGCCGCCA TCGGGCTGAT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACACTGC 

101 TTGCGCCTTT GGGCTGGCAG CGCGGTCTGG CGGCTCTGCT TTTGGGTCAT 
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151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

201 CGGACGCAGC TCGATGGAAA GCGTGCGCCT GTCGTTCGGC AAACGCGGTT 

251 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

301 GTGATGATTT ACGCCGGCGC AACGGTCAGC TCCGCTTTGG GCAAAGTGTT 

351 GTGGGACGGC GAATCTTTTG TCTGGTGGGC ATTGGCAAAC GGCGCGCTGA 

4 01 TTGTGCTGTG GCTGGTTTTC GGCGCACGCA AAACAGGCGG GCTGAAAACC 

4 51 GTTTCGATGC TGCTGATGCT GTTGGCGGTT CTGTGGCTGA GTGCCGAAGT 

501 CTTTTCCACG GCAGGCAGCA CCGCCGCACA GGTTTCAGAC GGCATGAGTT 

551 TCGGAACGGC AGTCGAGCTG TCCGCCGTGA TGCCGCTTTC TTGGCTGCCG 

601 CTGGCCGCCG ACTACACGCG CCACGCGCGC CGCCCGTTTG CGGCAACCCT 

651 GACGGCAACG CTCGCCTACA CGCTGACCGG CTGCTGGATG TATGCCTTGG 

7 01 GTTTGGCAGC GGCGTTGTTC ACCGGAGAAA CCGACGTGGC AAAAATCCTG 

7 51 CTGGGCGCAG GTTTGGGTGC GGCAGGCATT TTGGCGGTCG TCCTGTCGAC 

8 01 CGTTACCACC ACTTTTCTCG ATGCCTACTC CGCCGGCGTA AGTGCCAACA 
8 51 ATATTTCCGC CAAACTTTCG GAAATACCCA TCGCCGTTGC CGTCGCCGTT 
901 GTCGGCACAC TGCTTGCCGT CCTCCTGCCC GTTACCGAAT ATGAAAACTT 
951 CCTGCTGCTT ATCGGCTCGG TATTTGCGCC GATGGCG . GC GGTTTTGATT 

1001 GCCGACTTTT TCGTCTTGAA ACGGCGTGA 

This corresponds to the amino acid sequence <SEQ ID 462; ORF 125.a>: 

al25.pep 

1 MSGNASSPSS SAAIGLIWFG AAVSIAEIST GTLLAPLGWQ RGLAALLLGH 

51 AVGG ALFFAA AYIGALTGRS SMESVRLSFG KRGSVLFSVA NMLQLAGWTA 

101 VMIYAGATVS SALGKVLWDG ES FVWWALAN GALIVLWLV F GARKTGGLKT 

151 VSMLLMLLAV LWLSAEV FST AGSTAAQVSD GMSFGTAVEL SAVMPLSWLP 

201 LAADYTRHAR RPFAATLTAT LAYTLTGCWM YALGLAAALF TGETDVAKIL 

251 LGAGLGAAGI LAVVL STVTT TFLDAYSAGV SANNISAKLS EIPIAVAVAV 

301 VGTLLAVLLP VTEYENFLLL IGSVFAPMAX GFDCRLFRLE TA* 



ml25/al25 95.6% identity in 342 aa overlap 

10 20 30 40 50 60 

ml25 . pep MSGNASSPSSSSAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 
j I [ I I I ! I I I I : I I ! I M I I I i M I I M I I ! I M M I II I I I I I! I I I I I II I M I I I ! I 
al25 MSGNASSPSSSAAIGLIWFGAAVSIAEISTGTLLAFLGWQRGLAALLLGHAVGGALFFAA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml25 . pep AYIGALTGRS SMESVRLSFGKRGSVLFSVANMLQLAGWTAVMIYAGATVS SALGKVLWDG 1 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 1 I I I I II I I 1 I 
al25 AYIGALTGRS SMESVRLSFGKRGSVLFSVANMLQLAGWTAVMIYAGATVS SALGKVLWDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml25 . pep ESFVWWALANGAL I VLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFST AGSTAAQVSD 

I It I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I M I I j I I | I | | | 
al25 ESFVWWALANGALIVLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFSTAGSTAAQVSD 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml25 . pep GMS FGT AVE LSAVMPLSWLPLAADYTRHARRPFAATLT AT LAYTLTGCWM YALGLAAALF 

I M II I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I M I I I I I I I I I I I I M I I I M I 
al25 GMSFGTAVELSAVMPLSWLPLAADYTRHARRPFAATLTATLAYTLTGCWMYALGLAAALF 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml25 . pep TGETDVAKILLGAXLGAAGILAWLSTVTTTFLDAYSAGASANNISARFAETPVAVXVTL 
I I I I I I I I I I I I I I I I ! I i I I I I I I I 1 I I I I I I I I I II : i M I I I I : : : I I : II 1 : : 
al2 5 TGETDVAKILLGAGLGAAGILAWLSTVTTTFLDAYSAGVSANNISAKLSEIPIAVAVAV 

250 260 270 280 290 300 

310 320 330 340 

inl25 . pep IGTVLAVMLPVTEYENFLLLIGSVFAPMAGGFDCRLFRLETAX 
: I I : I I I : I I I I I I I I I I I I I I I 11 I I I I I I I I I I f I I I I I I 
al25 VGTLLAVLLPVTEYENFLLLIGSVFAPMAXGFDCRLFRLETAX 
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310 320 330 340 



gl26 . seq 






1 


AtgccgtcTG AAaccCcaaa 


51 


GTCCGACAAC 


CATACCAAAG 


101 


CTTTCCCTTC 


GCGGCTGCTg 


151 


ATCCTCAAAC 


AATCCGTCCG 


201 


GCTGCGCCGC 


ACGGGATGCG 


251 


CGCTGCTTCA 


AGAAACCGGC 


301 


CAAAGCGTGC 


AGGAAGCGGT. 


351 


TGAAACCGAT 


TGGATAAAAT 


401 


AGCCGGACGT 


GTTCCAACTC 


451 


GGCTTCAAAG 


TGCTGCCTTA 


501 


CCTGCTCGAT 


GCGGGCTGTC 


551 


GCACGGGTTT 


GGGGGCGGTT 


■ 601 


CGCCTGCCCG 


ACACGCCGCT 


651 


CCAAGCGGCA 


CAAGTGATGG 


701 


CCGCCGTTTC 


CCGCAGCGGC 


751 


CTCGCCGTCG 


AATCCGGACG 


801 


GCGAACCAAA 


GCCCAAGCCA 


851 


ATTCGGCGGA 


ATATTGA 



This corresponds to the amino acid sequence <SEQ ID 464; ORF 126.ng>: 

gl26.pep 

1 MPSETPKARR RLSDGIASDN HTKESIMLTL YGETFPSRLL LGTAAYPTPE 

51 IL.KQSVRTAR PAMITVSLRR TGCGGEAHGQ GFWSLLQETG VPVLPNTAGC 

101 QSVQEAVTTA QMAREVFETD WIKLELIGDD DTLQPDVFQL VEAAEILIKD 

151 GFKVLPYCTE DLIACRRLLD AGCQALMPWA APIGTGLGAV HAYALKILRE 

201 RLPDTPLIID AGLGLPSQAA QVMEWGFDGV LLNTAVSRSG DPVNMARAFA 

251 LAVESGRLAF EAGPVEARTK AQASTPTVGQ PFWHSAEY* 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 465>* 

ml26.seq (partial) 

1 . . CACTATACAA AGGAACCCAT TATGCTCACC CTATACGGCG AAACTTTCCC 

51 CTCGCGGCTG CTGCTCGGCA CGGCTGCCTA CCCGACCCCC GAAATCCTCA 

101 AACAATCCAT CCAAACCGCC CAGCCTGCGA TGATTACCGT CTCGCTGCGC 

151 CGCGCGGGAA GCGGCGGCGA GGCGCACGGT CAGGGGTTTT GGTCGCTGCT 

2 01 TCAAGAAACC GGCGTTCCCG TCCTGCCGAA CACGGCAGGC TGCCAAAGCG 
251 TGCAGGAAGC GGTAACGACG GCGCAAATGG CGCGCGAAGT GTTTGAAACC 

3 01 GATTGGATAA AATTGGAACT CATCGGAGAT GACGACACCT TGCAGCCGGA 
3 51 TGTGTTCCAG CTTGTCGAAG CGGCGGAAAT CCTGATTAAA GACGGCTTCA 
401 AAGTGCTGCC TTATTGCACC GAAGACCTGA TTGCCTGCCG CCGCCTGCTC 
451 GACGCGGGCT GTCAGGCGTT GATGCCGTGG GCGGCCCCGA TCGGCACGGG 
501 TTTGGGCGCG GTTCACGCCT ACGCGTTGAA CGTCCTGCGC GAACGCCTGC 
551 CCGACACGCC GCTGATTATC GACGCGGGCT TGGGTTTGCC CTCACAGGCG 
601 GCACAAGTGA TGGAATGGGG CTTTGACGGC GTGCTTTTGA ATACTGCCGT 
651 TTCCCGCAGC GGCGATCCGG TCAATATGGC ACGCGCCTTC GCACTCGCCG 
701 TCGAATCCGG ACGGCTGGCA TTTGAAGCCG GACCGGTCGA AG CACG CGAC 
751 AAAGCGCAAG CCAGCACGCC GACAGTCGGA CAACCGTTTT GGCATTCGGC 
801 GGAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 466* ORF 126>* 

ml26.pep (partial) 9 

1 . . HYTKEPIMLT LYGETFPSRL LLGTAAYPTP EILKQSIQTA QPAMITVSLR 

51 RAGSGGEAHG QGFWSLLQET GVPVLPNTAG CQSVQEAVTT AQMAREVFET 
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101 DWIKLELIGD DDTLQPDVFQ LVEAAEILIK DGFKVLPYCT EDLIACRRLL 

151 DAGCQALMPW AAPIGTGLGA VHAYALNVLR ERLPDTPLI I DAGLGLPSQA 

201 AQVMEWGFDG VLLNTAVSRS GDPVNMARAF ALAVESGRLA FEAGPVEARD 

251 KAQASTPTVG QPFWHSAEY* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 126 shows 95.9% identity over a 269 aa overlap with a predicted ORF (ORF 126.ng) 
from N. gonorrhoeae: 

m!2 6/gl26 

10 20 30 40 

ml26 .pep HYTKEPIMLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQ 

-HI i ' IMI I I II II I I I M I II I I II || I I M - M : 
gl26 MPSETPKARRRLSDGIASDNHTKESIMLTLYGETFPSRLLLGTAAYPTPEILKQSVRTAR 

10 20 30 40 50 60 

50 60 70 80 90 100 

ml26 . pep PAMITVSLRRAGSGGEAHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETD 

MM I 111 I hi II I M I M i I M 1 1 1 M M M 1 1 M M 1 1 1 1 M 1 1 1 M I I II M M I 

g 12 6 PAMITVSLRRTGCGGEAHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETD 

70 80 90 100 110 120 

110 120 130 140 150 160 

ml26 ,pep WIKLELIGDDDTLQPDVFQLVEAAEILIKDGFKVLPYCTEDIilACRRLLDAGCQALMPWA 

MM 1 1 M M 1 1 1 1 1 1 ! E 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 |j i 1 1 1 1 ! M [ 

gl26 WIKLELIGDDDTLQPDVFQLVEAAEILIKDGFKVLPYCTEDLIACRRLLDAGCQAliMPWA. 

130 140 150 160 170 180 

170 180 190 200 210 220 

ml26 . pep APIGTGLGAVHAYALNVLRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLXJSTTAVSRSG 

I I II I lil M Ml 1 1- I I III 1 1 Mil IN I I II I 1 1 IM M ■ 1 1 1 II II I M I MM I 

gl26 AP I GTGLGAVHAYALKI LRERLPDTPL 1 1 DAGLGLPS QAAQVME WGFDGVLLNTAVSRSG 

190 200 210 220 230 240 

230 240 250 260 270 

ml2 6 . pep DPVNMARAFALAVE SGRLAFEAGPVEARDKAQAS TPTVGQ P FWHS AE YX 

I I f I 1 I M 1 1 1 1 1 1 1 1! I M I f 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 ! I i 1 1 1 1 1 1 1 i 

gl26 DPVNMARAFALAVES GRLAFE AG PVEARTKAQAS TPTVGQ P FWHS AE YX 

250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 467>: 

al26. seq 

1 TTGTTAATCC AC TAT AC AAA GGAACCCATT ATGCTCACCC TGTACAGCGA 

51 AACTTTCCCT TCGCGGCTGC TGCTCGGCAC AGCCGCCTAC CCGACCCCTG 

101 AAATCCTCAA ACAATCCGTC CGAACCGCCC GGCCCGCGAT GATTACCGTC 

151 TCGCTGCGCC GCGCGGGATG CGGCGGCGAG GCGCACGGTC AGGGGTTTTG 

201 GTCGCTGCTT CAAGAAACCG GCGTTCCCGT CCTGCCGAAC ACGGCAGGCT 

251 GCCAAAGCGT GCAGGAAGCG GTAACGACGG CGCAAATGGC GCGCGAAGTG 

301 TTTGAAACCG ATTGGATTAA ACTCGAACTC ATCGGCGACG ACGACACCTT 

351 GCAGCCGGAT GTGTTCCAAC TTGTCGAAGC GGCGGAAATC CTGATTAAAG 

4 01 ACGGCTTCAA AGTGCTGCCT TATTGCACCG AAGACCTGAT TGCCTGCCGC 

4 51 CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG ATGCCGTGGG CGGCCCCGAT 

501 CGGCACGGGT TTGGGCGCGG TTCACGCCTA CGCGTTGAAC GTCCTGCGCG 

551 AACGCCTGCC CGACACGCCG CTGATTATCG ACGCGGGCTT GGGTTTGCCC 

601 TCACAGGCGG CACAAGTGAT GGAATGGGGC TTTGACGGCG TGCTTTTGAA 

651 TACTGCCGTT TCCCGCAGCG GCGATCCGGT CAATATGGCA CGCGCCTTCG 

7 01 CACTCGCCGT CGAATCCGGA CGGCTGGCAT TTGAAGCCGG ACCGGTCGAA 

751 GCACGCGACA AAGCGCAAGC CAGCACGCCG ACAGTCGGAC AACCGTTTTG 

801 GCATTCGGCG GAATATTGA 



BNSDOCID: <WO 9957280A2 J _> 



WO 99/57280 ^ ^ PCT7US99/09346 

358 



This corresponds to the amino acid sequence <SEQ ID 468; ORF 126. a>: 

al26 .pep 

1 LLIHYTKEPI MLTLYSETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV 

51 SLRRAGCGGE AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV 

101 FETDWIKLEL IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR 

151 RLLDAGCQAL MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP 

201 SQAAQVMEWG FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE 

251 ARDKAQASTP TVGQPFWHS A EY * 

ml26/al26 98.1% identity in 269 aa overlap 

10 20 30 40 50 

ml26.pep HYTKEPIMLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPAMITVSLRRAGSGGE 
! I I U M II I I I : I II I I II I I I I I I i I I I I I I I I I :: I I : I I II I I M I I 1 I III 
al26 LLIHYTKEPIMLTLYSETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRAGCGGE 

10 20 30 40 50 60 

60 70 80 90 100 110 

ml2 6.pep AHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPD 
M M II If ! M I M I I I [ M f t II I I I I I I I 1 | M I M I I I I I j I j I i 1 I M f | | M | 1 | 
al26 AHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPD 

70 80 90 100 110 120 

120 130 140 150 160 170 

ml26.pep VFQLVEAAEI LI KDGFKVLPYCTEDL I ACRRLLDAGCQALMPWAAP I GTGLGAVHAYALN 

1 I 1 M II I I i I I I I I I I M I I I i I I | | | | | | | | M 1 M I I I M i I I I I i | | | | | | | | | | | 
al26 VFQLVEAAE I LIKDGFKVLPYCTE DLI ACRRLLDAGCQALMPWAAP I GTGLGAVHAYALN 

130 140 150 160 170 180 

180 190 200 210 220 230 

ml 2 6. pep VLRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESG 

M I II I I I If I I I I I I I II I I I I I I I I | M | | | | | (j | M || | | | | | | | M I M I I M I I 
al2 6 VLRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESG 

190 200 210 220 230 240 

240 250 260 270 

ml2 6 . pep RLAFEAGPVEARDKAQASTPTVGQPFWHSAEYX 

I I I I I I I I I I I I II I I I II I I I I I I I I I I I M I 
a 12 6 RLAFEAGPVEARDKAQASTPTVGQPFWHSAEYX 

250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 469>: 

g!26-l.seq 

1 ATGCTCACCC TGTACGGCGA AACTTTCCCT TCGCGGCTGC TGCTCGGCAC 

51 GGCCGCCTAC CCGACCCCTG AAATCCTCAA ACAATCCGTC CGAACCGCCC 

101 GGCCCGCGAT GATTACCGTC TCGCTGCGCC GCACGGGATG CGGCGGCGAG 

151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATAAA ATTGGAACTC 

301 ATCGGCGACG AC G AC AC C TT GCAGCCGGAC GTGTTCCAAC TCGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ATGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCTCCCAT CGGCACGGGT TTGGGGGCGG TTCACGCCTA 

501 TGCGCTCAAA ATCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCT TCCCAAGCGG CACAAGTGAT GGAATGGGGT 

601 TTTGACGGCG TATTGTTAAA CACCGCCGTT TCCCGCAGCG GCGACCCCGT 

651 CAACATGGCG CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG GCCGGTCGAA GCGCGAACCA AAGCCCAAGC CAGCACGCCG 

751 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 470; ORF 126-1. ng>: 

gl26-l .pep 

1 MLTLYGETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV SLRRTGCGGE 
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51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALK ILRERLPDTP LIIDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARTKAQASTP 

251 TVGQPFWHSA EY* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 471 >: 

ml 2 6-1 . seq 

1 ATGCTCACCC TATACGGCGA AACTTTCCCC TCGCGGCTGC TGCTCGGCAC 

51 GGCTGCCTAC CCGACCCCCG AAATCCTCAA ACAATCCATC CAAACCGCCC 

101 AGCCTGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGAAG CGGCGGCGAG 

151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATAAA ATTGGAACTC 

301 ATCGGAGATG ACGACACCTT GCAGCCGGAT GTGTTCCAGC TTGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCCCCGAT CGGCACGGGT TTGGGCGCGG TTCACGCCTA 

501 CGCGTTGAAC GTCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCC TCACAGGCGG CACAAGTGAT GGAATGGGGC 

601 TTTGACGGCG TGCTTTTGAA TACTGCCGTT TCCCGCAGCG GCGATCCGGT 

651 CAATATGGCA CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG ACCGGTCGAA GCACGCGACA AAGCGCAAGC CAGCACGCCG 

751 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 



This corresponds to the amino acid sequence <SEQ ID 5; ORF 126-1>: 

ml 2 6-1 .pep 

1 MLTLYGETFP SRLLLGTAAY PTPEILKQSI QTAQPAMITV SLRRAGSGGE 

51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARDKAQASTP 

251 TVGQPFWHSA EY* 



ml26-l/gl26-l 96.9% identity in 262 aa overlap 

10 20 30 40 50 60 

X012 6-1 .pep MLTLYGETFPS RLLLGTAAYPTPE I LKQS I QTAQPAMIT VSLRRAGSGGEAHGQGFWS LL 
I I I I I I i I I I I I I M I M I I I I I I i I I I I : : 1 I : I I I I I I I I I i : I I I I I I I I I I I I I I 
g!2 6- 1 MLTLYGETFPS RLLLGTAAYPTPEILKQSVRTARPAMITVSLRRTGCGGEAHGQGFWSLL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 2 6-1 . pep QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 
I I I I I I I I I I I I I I I E 1 I I I I I I I II I I I I f I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I 
gl26-l QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml26-l .pep L I K DG FK VL P YCT EDL I AC RRLL D AGC Q ALM PW AAP I GTG LGAVHAYALN VLRERLPDTP 
1 I I I I I I I I I I I I I I I I I I I I I I I I I M I I i 11 I I I f I I I I I I I I I I I I ::! I I I I I I I I 
gl26-l L I KDG FKVLP YCTEDLI ACRRLLDAGCQALMPWAAPI GTGLGAVHAYALKI LRERLPDT P 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml26-l . pep L 1 1 DAGLG L P S QAAQVMEWG F DG VLLN T A V S RS GD P VNMARA FAL A VE S GRL AFE AG P VE 
I I I I I I I t I I I I I t I I I I I I I M I I I I I I I If I I I I I I I If I I I J I I I I I t t I I I I I I I I 
gl26-l L 1 1 DAGLG LP S QAAQ VM EWG FD G V LLNT A V S RS G D P VNMARA FAL AVE S GRLAFE AG P V E 

190 200 210 220 230 240 



250 260 
ml 2 6-1 .pep ARDKAQASTPTVGQPFWHSAEYX 
II M I I I I I I 1 I I I I I I I I I I I 
g 1 2 6- 1 ARTKAQASTPT VGQP FWHSAEYX 

250 260 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 473>: 

al26-l .seq 

1 ATGCTCACCC TGTACAGCGA AACTTTCCCT TCGCGGCTGC TGCTCGGCAC 

51 AGCCGCCTAC CCGACCCCTG AAATCCTCAA ACAATCCGTC CGAACCGCCC 

101 GGCCCGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGATG CGGCGGCGAG 
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151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATTAA ACTCGAACTC 

301 ATCGGCGACG ACGACACCTT GCAGCCGGAT GTGTTCCAAC TTGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

4 01 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCCCCGAT CGGCACGGGT TTGGGCGCGG TTCACGCCTA 

501 CGCGTTGAAC GTCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCC TCACAGGCGG CACAAGTGAT GGAATGGGGC 

601 TTTGACGGCG TGCTTTTGAA TACTGCCGTT TCCCGCAGCG GCGATCCGGT 

651 CAATATGGCA CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG ACCGGTCGAA GCACGCGACA AAGCGCAAGC CAGCACGCCG 

751 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 474* ORF 126-1 a>- 

al26-l .pep ' 

1 MLTLYSETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV SLRRAGCGGE 

51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARDKAOASTP 

251 TVGQPFWHSA EY* U 

al26-l/ml26-l 98.1% identity in 262 aa overlap 

10 20 30 40 50 6 n 

al2 6-l.pep MLTLYSETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRAGCGGEAHGQGFWSLL 
l)1!,:|,ll,llll,l "IIMIIIIH::||:|MIII!||i|| tilllMIIIIII 
»126-1 MLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPAMITVSLRRAGSGGEAHGQGFWSL^ 



10 20 30 40 



50 60 



al26-l.pep 



70 80 90 100 no 120 

QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 

"i" JL' 1 1 ' 1 " " t f I I I I I I I I I I I I I I I I I I I | | | | ! | , | | | , , , T| | , ,?| I m M 
ml26-l QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 



70 80 90 100 



110 120 



al26-l.pep 



130 140 150 160 170 180 

LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I f 1 I I I I J I I t I I i I I I I I I I I I I I | | | | | | | | | | | | I I I | I | ] | 
H1126-1 LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAY^ 

130 140 150 160 170 180 

190 200 21° 220 230 240 

al26-l.pep LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 

ml26-l LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVN^RAFALAV^ 



190 200 210 220 



230 240 



250 260 
a!26-l.pep ARDKAQASTPTVGQPFWHSAEYX 
1 1 I I I I I I I I I I I I I I I | | | I I I 
ml2 6-l ARDKAQASTPTVGQPFWHSAEYX 
250 260 



The follo^rag partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 475>; 



i 

51 
101 



ATGGAAATAT GGAATATGTT GAACACTTGG CCCGATGCCG TCCCGATACG 
CGCGGAGGCG GCCGAATCCG TGGCGGCGGT CGCGGCTTTG CTGCTGGCGC 
GCGCCCTTCT GTTGAATATC CACTTCAGAC GGCATCCGGA TTTCGGCATC 
151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 
201 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC GGCGCAAATT CAAACGCTGG 

2 51 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACAAAAGAA 

3 01 CTGATTATGT GTCTGTCGGG CAGTATTTTA aggtctGCCA CCCAGCAATA 
351 CTCGGTCGGC GACTATATCG AAATCAACGG CCTGCGCGGG CGCGTGOTcJ 
AM ACATCAATCT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 

GTCGGACAGC TTGCGGGAAC caccgtttct TTCCCCAACA GCCTGTTGTT 
GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAT GTCATCCATA 



401 
451 
501 
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551 CGGTCGAAAT CCCCGTTCCC ATCCATTTGG ATT CGGATG A AGCCGTATGC 

601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 

651 TCAGCGGTAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 

701 CCGCCAGGCC GCGCGTTACC CGCGTACCGT ACGACGACAA GGCATACCGC 

751 ATCATCGTCC GCTTCGCCTC CCCCGTTTCA AAG CGGCTGG AAATCCAACA 

801 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATCATC 

851 CCGCCGgctC cgAAACACTT TAA 



This corresponds to the amino acid sequence <SEQ ID 476; ORF 127.ng>: 

g!27 .pep 



1 MEIWNMLNTW PDAVP I RAE A AES VAAVAAL LLARALLLNI HFRRHPDFGI 

51 ESKRRFLVAS RNITLLLVLF SLAFIWSA QI Q TLALSMFAV AAAVWAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRY LENVQAEKLF ITPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGS ETL * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 477>: 
ml27 . seq 



1 ATGGAAATAT GGAATATGTT GGACACTTGG CTCGGTGCCG TC CCGATACG 

51 TGCGGAGGCG GTCGAATCCG TGGCGGCGGT TGCGGCTTTG CTGCTGGCGC 

101 GCGCCCTTCT GTTGAATATC CACTTCAAAC GGCATCCGGA TTTCGGCATC 

151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 

201 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC GGCGCAAATC CAAACGCTGG 

251 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACGAAGGAA 

3 01 CTGATTATGT GTCTGTCGGG CAGTATTTTA AGGTCTGCCA CCCAGCAATA 
351 CTCGGTCGGC GACTATATCG AAAT CAACGG CCTGCGCGGG CGCGTGGTGG 

4 01 ACATCAACCT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 
451 GTCGGACAGC TTGCGGGAAC CACCGTTTCT TTCCCCAACA GCCTGTTGTT 
501 GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAT GTCATCCATA 
551 CGGTCGAAAT CCCCGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 
601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 
651 CCAACGGsAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 
701 CCGCCAGACC GCGCGTTACC CGCGTGCCGT ACGATGACAA GGCATACCGC 
751 ATCATCGTCC GCTTCGCTTC CCCCGTTTCA AAGCGGCTGG AAATCCAACA 
801 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATCACC 
851 CCGCCGGCTC CGAAACACTT TAA 



This corresponds to the amino acid sequence <SEQ ID 478; ORF 127>: 

ml27 .pep 

1 MEIWNMLDTW LGAVPIRAEA VESVAAVAAL LLARALLLNI HFKRHPDFGI 



51 ESKRRFLVAS R NITLLLVLF SLAFIWSA QI Q TLALSMFAV AAAVWAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRX LENVQAEKLF ITPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGSETL * 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 127 shows 97.9% identity over a 290 aa overlap with a predicted ORF (ORF 127.ng) 
from N. gonorrhoeae: 

ml27/gl27 



m!27 .pep 



10 20 30 40 50 60 

ME I WNMLDTWLGAVP I RAEAVES VAAVAALLLARALLLNI HFKRHPDFGI ES KRRFLVAS 



gl27 




10 20 30 40 50 60 



ml27 .pep 



70 80 90 100 110 120 

RNITLLLVLFSIaAFIWSAQIQTLALSMFAVAAAVWATKELIMCLSGSILRSATQQYSVG 



gi27 
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70 80 90 100 



110 120 



130 140 150 160 170 isn 

m!27 .pep ^^^^y^^^^^^^'^^^^^ I ^^ Tt * MM Q^^^ Np ^ v GQIjAGTrrV'S F3E»NSIjIjIjSHPVRRX3N'HjGDY 

IIIMIM,,|, iNNiiMiiiiiiiiiiiiiiiMiiinii| MM |||,|||,|| 

9127 DYIEINGLRGRWDINLI^TI^QVGPNPLVGQI^ 

130 140 150 160 170 180 

190 200 210 220 230 240 

n.127 .pep yiHyyEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAlORXLENVQAEKLFITPAARPRVT 

HMIMMIIIIIMMIIIIIMIIIMIIMIIMI II I II I Minimum 

9127 VIHTOEIPVPIHLDSDEAVCRLKAVLEPL^^ 

190 20 ° 210 220 230 240 

250 260 270 280 290 

ml27 .pep RVPyDDKAYRIIVRFASPVSKRIiEIQQAVMDEFLRVQYRLLNHPAGSETLX 

"IIIIIIIIIIIIIIIIIIMIIIIMIIIIIIMINIIIIIIIIIIII 

9127 RVPYDDKAYRIIVRFASPVSKRLEIQQAVMDEFLRVQYRLIjIniPAGSETLX 

250 260 270 280 290 



al27 . seq 




1 


ATGGAAATAT 


51 


TGCGGAGGCG 


101 


GCGCCCTTCT 


151 


GAAAGCAAGC 


201 


GGTGCTGTTT 


251 


CTTTGTCGAT 


301 


CTGATTATGT 


351 


CTCGGTCGGC 


401 


ACATCAACCT 


451 


GTCGGACAGC 


501 


GAGCCACCCC 


551 


CGGTCGAAAT 


601 


CGTCTGAAAG 


651 


CCAACGGCAT 


701 


CCGCCAAACC 


751 


ATCATCGTCC 


801 


GGCGGTTATG 


851 


CCGCCGGCTC 



This corresponds to the amino acid sequence <SEQ ID 480- ORF 127 a>- 

al27.pep ' 

1 MEIWNMLDTW LGAVPIRAEA VES VAWAAL LLARAU.T.K.T HFKRHPDFGI 

51 ESKRRFLVAS RNITLLLVLF SLAFTWSAQI OT LALSMFAV MA VWAm 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLE PLC APYIPAIQRH LENVQAEKLF ITPAAKPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNYPAGSETL * 

ml27/al27 98.6% identity in 290 aa overlap 

ml27.pep ^J^MLDTWLGAyPIRAEAy^ 

= 10-7 '''''''''' : ' I I M I I I I I I | | | | | | | | | | | i i i i | | i i i i i i 

al27 MEIWNMLDTWLGAVPIRAEAVESVAWAALLIARALLLNIHre 

50 60 



10 20 30 40 



ml27.pep ^^^yLFSLAFrwSAQIQTLALSM 

= 1 0-7 ''''''''''' I I I I I I I I I I I I | | | | | | | | | I | I I I I | I I I I I I 1 I || i | || 

al27 RNITLLLVLFSlAFIWSAQIQTIALSMFAVAAAWVATKELiMCLSGSILRS^ 

° 80 90 100 no 120 
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130 140 150 160 170 180 

ml27 . pep DYIEINGLRGRWDINLLNTLMMQVGPNPLVGQLAGTTVSFPNSLLLSHPVRRDNILGDY 
I I I I I I I I I I I I I I I I I I I I II II I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I 1 II 
al27 DYIEINGLRGRWDINLLNTLMMQVGPNPLVGQLAGTTVSFPNSLLLSHPVRRDNILGDY 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml27 . pep VI HTVEI PV P IHLD S DEAVCRLKAVLE PLCAPYI PAI QRXLENVQAEKLFITPAARPRVT 

I I I I I I I I I I II I I I I I I I I I 1 I I I I I I I I I I II I I I I I I I II I I I I I I I M I I : I I I I 
al27 VIHTVEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAIQRHLENVQAEKLFITPAAKPRVT 

190 200 210 220 230 240 

250 260 270 280 290 

ml27 . pep RVPYDDKAYRIIVRFASPVSKRLEIQQAVMDEFLRVQYRLLNHPAGSETLX 
I I I I I I I I I I I I I I I M I II I I I I I I I I M I I I II I M I I I I : I I I I II I I 
al27 RVPYDDKAYRIIVRFASPVSKRLEIQQAVMDEFLRVQYRLLNYPAGSETLX 

250 260 270 280 290 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 48 1>: 

gl28 . seq 

1 atgattgaca acgCActgct ccacttgggc gaagaaccCC GTTTTaatca 

51 aatccaaacc gaagACAtca AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

3 01 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

3 51 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

401 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGccaaa cTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 AT CGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACtaca AAGGCCGCCG CCGCTTTGCC GACGgcacGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AacCGGCCAC GGACTGCACC ACCTG CTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TGTCCGGCAT CAAcggcgtA GAATGGGACG CGGTCGAACT GCCCAGCCAG 

1501 TTTATGGAAA ACTTCGTTTG GGAATACAAT GTATTGGCAC AAATGTCCGC 

1551 CCACGAAGAA AccgGCGAGC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

16 01 TcgcCGCCAA AAACTTCCAG CGCGGTATGT TCCTCGTCCG GCAAATGGAG 

16 51 TTCGCCCTCT TCGATATGAT GATTTACAGT GAAAGCGACG AATGCCGTCT 

1701 GAAAAACTGG CAGCAGGTTT TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 

1751 TCCAACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCacatctTC 

1801 GCcggcGGCT ATTCCGCAGG CTATTACAGC TACGCATGGG CCGAAGTCCt 

1851 CAGCACCGAT GCCTACGCCG CCTTTGAAGA AAGcGACGac gtcGCCGCCA 

1901 CAGGCAAACG CTTCTGGCAA GAAAtccttg c eg t egg egg ctCCCGCAGC 

1951 gcgGCGGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 
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2001 ACTGCTGCGC CAaagcggtT TCGACAACGC gGCttgA 

This corresponds to the amino acid sequence <SEQ ID 482- ORF 128 ne>- 
gl28.pep ' • 5 • 

1 MIDNALLHLG EEPRFNQIQT EDIKPAVQTA lAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPIiAG I PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE . TGEPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS ESDECRLKNW QQVLDSVRKE VAVIQPPEYN RFANSFGHIF 

601 AGGYSAGYYS YAWAEVLSTD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG REPSIDALLR QSGFDNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 483>- 

ml28.seq (partial) * v OJ ' 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 
51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 
101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 
151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 
201 GGGCGTGGTG TCGCACCTCA ACTGCGTCGC CGACACGCCC GAACTGCGCG 
CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 
GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 
51 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCAC 
1 TACGCCAGCG AAAAACTGCG CGAAGCCAAA TACGCGTTCA GCGAAACCGA 
wGTCAAAAAA TAyTTCCCyG TCGGCAAwGT ATTAAACGGA CTGTTCGCCC 
AAmTCAAAAA ACTmTACGGC ATCGGATTTA CCGAAAAAAC yGTCCCCGTC 
TGGCACAAAG ACGTGCGCTA TTkTGAATTG CAACAAAACG GCGAAmCCAT 
AGGCGGCGTT TATATGGATT TGTACGCACG CGAAGGCAAA CGCGGCGGCG 
CGTGGATGAA CGACTACAAA GGCCGCCGCC GTTTTTCAGA CGGCACG CTG 
CAAyTGCCCA CCGCCTACCT CGTCTGCAAC TTCGCCCCAC CCGTCGGCGG 
CAGGGAAGCC CGCyTGAGCC ACGACGAAAT CCTCATCCTC TTCCACGAAA 
401 CCGGACACGG GCTGCACCAC CTGCTTACCC AAGTGGACGA ACTGGG CGTA 
451 TCCGGCATCA ACGGCGTAkA ATGGGACGCG GTCGAACTGC CCAGCCAGTT 
501 TATGGAAAAT TTCGTTTGGG AATACAATGT CTTGGCACAA mTGTCAGCCC 
551 ACGAAGAAAC CGGcgTTCCC yTGCCGAAAG AACTCTTsGA CAAAwTGCTC 
601 GCCGCCAAAA ACTTCCAAsG CGGCATGTTC yTsGTCCGGC AAwTGGAGTT 
651 CGCCCTCTTT GATATGATGA TTTACAGCGA AGACGACGAA GGCCGTCTGA 
701 AAAACTGGCA ACAGGTTTTA GACAGCGTGC GCAAAAAAGT CGCCGTCATC 
751 CAGCCGCCCG AATACAACCG CTTCGCCTTG AGCTTCGGCC ACATCTTCGC 
801 AGGCGGCTAT TCCGCAGCTn ATTACAGCTA CGCGTGGGCG GAAGTATTGA 
851 GCGCGGACGC ATACGCCGCC TTTGAAGAAA GCGACGATGT CGCCGCCACA 
901 GGCAAACGCT TTTGGCAGGA AATCCTCGCC GTCGGGGnAT CGCGCAGCGG 
951 nGCAGAATCC TTCAAAGCCT TCCGCGGCCG CGAACCGAGC ATAGACGCAC 
1001 TCTTGCGCCA CAGCGGTTTC GACAACGCGG TCTGA 

This corresponds to the amino acid sequence <SEQ ID 484- ORF 128>- 

ml 2 8 . Den f i a 1 \ 5 



251 
301 
351 
1 
51 
101 
151 
201 
251 
301 
351 



ml2 8 .pep (partial) 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA I KAQTHTGWA 
NTVEPLTGIT ERVGRIWGW SHLNCVADTP ELRAVYNELM PEITVFFTEI 



// 



51 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NH* 



1 
51 
101 
151 
201 
251 



YASEKLREAK YAFSE TXVKK YFPVGXVLNG LFAQXKKLYG IGFTEKTVPV 
WHKDVRYXEL QQNGEXIGGV YMDLYAREGK RGGAWMNDYK GRRRFSDGTL 
QLPTAYLVCN FAPPVGGREA RLSHDEILIL FHETGHGLHH LLTQVDELGV 
SGINGVXWDA VELPSQFMEN FWEYNVLAQ XSAHEETGVP LPKELXDKXL 
AAKNFQXGMF XVRQXEFALF DMMIYSEDDE GRLKNWQQVL DSVRKKVAVI 
QPPEYNRFAL SFGHIFAGGY SAAXYSYAWA EVLSADAYAA FEESDUVAAT 
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3 01 GKRFWQEILA VGXSRSGAES FKAFRGREPS IDALLRHSGF DNAV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 128 shows 91.7% identity over a 475 aa overlap with a predicted ORF (ORF 128.ng) 
from N. gonorrhoeae: 

ml28/gl28 

10 20 30 40 50 60 

MI DNALLHLGEEPRFNQ IQTEDI KPAVQTAI AEARGQI AAVKAQTHTGWANTVERLTGI T 

I II II 1 1 Ml I II Ml IMIIIIMM II 1 1 I II 1 1 1 1 = I ! i I I 1 1 M 1 1 1 ! Mill 

MTDNALLHLGE E P RFDQ I KTED I KP ALQTAI AE AREQ I AAI KAQTHTGW ANTVE P LTG I T 
10 20 30 40 50 60 

70 80 90 100 110 120 

ERVGRIWGWSHLNSWDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

IMMMMMMI I M M M M 1 1 1 1 M 1 1 II 1 1 1 II II 1 1 II I M 1 1 1 1 I M II II 

ERVGRIWGWSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 



gl28 .pep 
ml28 

gl28 .pep 
m!28 



130 140 150 160 170 180 

gl2 8 . pep TLS PAQKTKLDHDLRDFVLSGAELP PERQAELAKLQTEGAQLS AKF SQNVLDATDAFG I Y 

1 1 1 1 II 1 1 1 I M 

ml28 TLS PAQKTKLNH 

130 

// 

340 350 360 

gl28 . pep YAGEKLREAKYAFSETEVKKYFPVGKVLAG 

M M I M M 1 1 1 1 1 1 I I II Mill 1 1 I 

ml 2 8 YASEKLREAKYAFSETXVKKYFPVGXVLNG 

10 20 30 



370 380 390 400 410 420 

gl28 . pep LFAQIKKLYGIGFAEKTVPVWHKDVRYFELQQNGKTIGGVYMDLYAREGKRGGAWMNDYK 

MM I I 1 1 I I I M M M 1 1 II 1 1 1 1 1 1 1 I II M M 1 1 M I II M M M II II I II I II 

ml2 8 LFAQXKKLYG I GFTE KTVP VWHKDVRYXE LQQNGEXI GGVYMDLYAREGKRGGAWMNDYK 

40 50 60 70 80 90 



430 440 450 460 470 480 

gl2 8 . pep GRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVDELGV 

MM Ml I Ml I M MM I MM II MM! II M I I M 1 1 M I M I M M M M I M M 

ml28 GRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVDELGV 

100 110 120 130 140 150 



490 500 510 520 530 540 

gl28 . pep SGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGEPLPKELFDKMIxAAKNFQRGMF 

MINI I Ml I I I MM M I Ml I I II M IMIIII IIMM M IMIMI Ml 

ml 2 8 SGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQXGMF 

160 170 160 190 200 210 



550 560 570 580 590 600 

gl28 . pep LVRQMEFALFDMMI YSESDECRLKNWQQVLDSVRKEVAVIQPPEYNRFANSFGHIFAGGY 

III I > I I I I M I I I h I I M I Ml Ml I M MM I Mill M MM IMMMIM 

ml2 8 XVRQXEFALFDMMI YSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHI FAGGY 

220 230 240 250 260 270 



610 620 630 640 650 660 

gl2 8 . pep SAGYYS YAWAEVLSTDAYAAFEESDDVAATGKRFWQEI LAVGGSRSAAES FKAFRGREPS 

IM M M M M Ml M MIMMMMM M MMMI M I I I : M I I I I I 1 1 I I I I 

ml2 8 SAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWQE I LAVGXSRSGAES FKAFRGREPS 

280 290 300 310 320 330 



BNSDOCID: <WO. 
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670 679 
I DALLRQSGFDNAAX 

1 1 M I h 1 1 II I h 

IDALLRHSGFDNAVX 
340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 485>: 

al28 . seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 
51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 
101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 
151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 
201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 
251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 
301 G G AC AAGAC A TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 
351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 
4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 
4 51 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 
551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 
601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 
651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

7 01 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

7 51 AAATT CGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 
801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

8 51 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

10 01 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 C C AAAT C AAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

1451 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCG CAAAGAA GTCGCCGTCG 

1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 486; ORF 128.a>: 

al28 .pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 



gl28 .pep 
m!28 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 
601 AGGYSAGVYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28/al28 66.0% identity in 677 aa overlap 

10 20 30 40 50 60 

ml28 . pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I [ I I t | | | | | | | | | | | 
al28 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml28 . pep ERVGRIWGWSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
I II I I I I I I I I I It I : I I I ! I II : I I I I i M M I I I j I I I I I I I I I | | | I I ! I I I i I I I 
al28 ERVGRIWGVVSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 

130 

ml28 . pep TLSPAQKTKLNH 

III II I I II I I 

al28 TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

130 140 150 160 170 180 



ml28.pep 

al28 FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 

190 200 210 220 230 240 



ml28.pep 

al28 TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

250 260 270 280 290 300 

140 150 

ml 2 8 . pep YASEKLREAKYAFSETXVKKYFPVGX 

I I : I I I I I I I I I I I I I I I I I I I II 
a!28 ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 

310 320 330 340 350 360 

160 170 180 190 200 210 

ml28 . pep VLNGLFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWM 
I I M I I I I I I I I I M I I I I I I I II I I M I I I I I II I I : I I I I I I I II M I I I I | | | | | 
al28 VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 

220 230 240 250 260 270 

ml28 . pep NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
I M I I I I I I I I I I I I I I I I I I I I I I : I I I I I : I I I I I I I I ! I I I I I I I I E I I I I I I I i I 
a 1 2 8 N DYKGRRRFS DGT LQLPTAYLVCNFTPPVGGKEARLSHDE I LTLFHETGHGLHHLLTQVD 

430 440 450 460 470 480 

280 290 300 310 320 330 

m!28 . pep ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQ 
I M I I II I I I I I I I I I I I 1 II I I I I I I I 11 I I I I I I I I I I I I I I I I I II II I I ! I I 
al28 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLT^AKNFQ 

490 500 510 520 530 540 

340 350 360 370 380 390 

ml28 .pep XGMFXVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 
Ml ME I I I I I i I M I I I I I I I M I I I I I I I I II I I : II I :: II I I I I I I MINI 
al28 RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAVVRPPEYNRFANSFGHIF 

550 560 570 580 590 600 
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400 410 420 430 440 4sn 

ml28.pep ^^^'^^'^^-•'S ADAYAAB"EE S D OVAATGKRB*WQE X XjAVGXSRSGAES FKAfRG 

i''' M: 11,1 11,1 'I I I MINI III || | in Ml MINIM Ml-llllll II 
al28 AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRS 

610 620 63 ° 640 650 660 

460 470 
ml28 . pep RE P S I D ALLRH S G FDN AVX 

I I M N II M II II I I M 
al 28 RE PS I DAL LRH S G FDN AAX 

670 



w™^^^ DNA Se( ^ uence was identified in N. gonorrhoeae <SEQ ID 487>: 

1 ATGATTGACA ACGCACTGCT CCACTTGGGC GAAGAACCCC GTTTTAATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

3 01 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

3 51 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

4 01 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 
4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 
551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
6 01 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 
651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 
701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 
751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACG CATTGAA 
801 AACCGCCAAA CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 
851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTT CCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGG CAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGCCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGCTTTGCC GACGGCACGC 

13 01 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 
13S1 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGCGCG 
1451 TGTCCGGCAT CAACGGCGTA AAA 

This corresponds to the amino acid sequence <SEO ID 488* ORF 1 2R 1 na>- 

gl28-l.pep (partial) ^ 5 ° 

1 MIDNALLHLG EEPRFNQIKT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 
51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVPFTEI 



GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRD FVL S GAELPPERQA 



101 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

ll\ ™ff KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

3 01 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

3 51 EVKKYFPVGK VLAGLFAQ I K KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 



ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 
EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YF 
IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LV 
451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV K 

S8-x°se q WinS PaitiaI DNA sec J uence was identified in N. meningitidis <SEQ ID 489>: 



1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

,01 ^^ ACC AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

11, ^™^ CA * AT08C00CC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 
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201 


GGGCGTGGTG 


TCGCACCTCA 


251 


CCGTCTATAA 


CGAACTGATG 


301 


GGACAAGACA 


TCGAGCTGTA 


351 


CGAATTCGAC 


ACCCTCTCCC 


401 


TGCGCGATTT 


CGTCCTCAGC 


451 


GAACTGGCAA 


AACTGCAAAC 


501 


CCAAAACGTC 


CTAGACGCGA 


551 


CCGCACCGCT 


TGCCGGCATT 


601 


GCCGCGCAAA 


GCGAAAGCAA 


651 


ACACTACCTC 


GCCGTCATCC 


701 


AAATCTACCG 


CGCCTACGTT 


751 


AAATTCGACA 


ACACCGCCAA 


801 


AACCGCCAAA 


CTGCTCGGCT 


851 


CCAAAATGGC 


GGACACGCCC 


901 


GCCCGCCGCG 


CCAAACCCTA 


951 


CTTCGCCCGC 


GAAAGCCTGA 


1001 


GCTACGCCAG 


CGAAAAACTG 


1051 


GAAGTCAAAA 


AATACTTCCC 


1101 


CCAAATCAAA 


AAACTCTACG 


1151 


TCTGGCACAA 


AGACGTGCGC 


1201 


ATAGGCGGCG 


TTTATATGGA 


1251 


CGCGTGGATG 


AACGACTACA 


1301 


TGCAACTGCC 


CACCGCCTAC 


1351 


GGCAGGGAAG 


CCCGCCTGAG 


1401 


AACCGGACAC 


GGGCTGCACC 


1451 


TATC CGG CAT 


CAACGGCGTA 


1501 


TTTATGGAAA 


ATTTCGTTTG 


1551 


CCACGAAGAA 


ACCGGCGTTC 


1601 


TCGCCGCCAA 


AAACTTCCAA 


1651 


TTCGCCCTCT 


TTGATATGAT 


1701 


GAAAAACTGG 


CAACAGGTTT 


1751 


TCCAGCCGCC 


CGAATACAAC 


1801 


GCAGGCGGCT 


ATTCCGCAGG 


1851 


GAGCGCGGAC 


GCATACGCCG 


1901 


CAGGCAAACG 


CTTTTGGCAG 


1951 


GCGGCAGAAT 


CCTTCAAAGC 


2001 


ACTC TTGCGC 


CACAGCGGTT 



ACTCCGTCGC CGACACGCCC GAACTGCGCG 
CCCGAAATCA CCGTCTTCTT CACCGAAATC 
CAACCGCTTC AAAACCATCA AAAATT CCCC 
CCGCACAAAA AACCAAACTC AACCACGATC 
GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 
CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
CCGACGCGTT CGGCATTTAC TTTGACGATG 
CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
AACAGGCTAC AAAATCGGCT TGCAGATTCC 
AATACGCCGA CAACCGCGAA CTGCGCGAAC 
ACCCGCGCCA GCGAACTTTC AGACGACGGC 
CATCGACCGC ACGCTCGCAA ACGCCCTGCA 
TCAAAAACTA CGCCGAATTG TCGCTGGCAA 
GAACAAGTTT TAAACTTCCT GCACGACCTC 
CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
ACCTCGCCGA TTTGCAACCG TGGGACTTGG 
CGCGAAGCCA AATACGCGTT CAGCGAAACC 
CGTCGGCAAA GTATTAAACG GACTGTTCGC 
GCATCGGATT TACCGAAAAA ACCGTCCCCG 
TATTTTGAAT TGCAACAAAA CGGCGAAACC 
TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
AAGGCCGCCG CCGTTTTTCA GACGGCACGC 
CTCGTCTGCA ACTTCGCCCC ACCCGTCGGC 
CCACGACGAA ATCCTCATCC TCTTCCACGA 
ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
GAATGGGACG CGGTCGAACT GCCCAGCCAG 
GGAATACAAT GTCTTGGCAC AAATGTCAGC 
CCCTGCCGAA AG AACTCTT C G AC AAAATG C 
CGCGGCATGT TCCTCGTCCG GCAAATGGAG 
GATTT AC AG C GAAGACGACG AAGGCCGTCT 
TAG ACAG CGT GCGCAAAAAA GTCG CCGTC A 
CGCTTCGCCT TGAGCTTCGG CCACATCTTC 
CTATTACAGC TACGCGTGGG CGGAAGTATT 
CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 
GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 
CTTCCGCGGC CGCGAACCGA GCATAGACGC 
TCGACAACGC GGTCTGA 



This corresponds to the amino acid sequence <SEQ ID 490; ORF 128-1>: 

ml28-l .pep . 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI P ED ALAMF AA 

2 01 AAQSESKTGY KIGLQIPHYL AVIQYADNRE LREQIYRAYV TRASELSDDG 
251 KFDNTANIDR TLANALQTAK LLGFKNYAEL SliATKMADTP EQVLNFLHDL 

3 01 ARRAKPYAEK DLAEVKAFAR ESLNLADLQP WDLGYASEKL REAKYAFSET 
351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFEliQQMGET 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQIiPTAY LVCNFAPPVG 
451 GREARLSHDE ILILFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 
501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKK VAVIQPPEYN RFALSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAV* 



ml28-l/g!28-l 94.5% identity in 4 91 aa overlap 

10 20 30 40 50 60 

gl2 8 - 1 . pep MI DtTALLHLGE E PR FNQ X KTED I KPAVQT AI AEARG Q I AAVKAQTHTGW ANTVERLTG I T 

I IMMMM II I Ml M MM MIMIMM II MIMIIIMMMMM lllll 

ml 2 8 - 1 MTDNAUaHLGEE PRFDQ I KTEDI KPALQTAI AEAREQI AAI KAQTHTGWANTVEPLTG I T 

10 20 30 40 50 60 



70 80 90 100 110 120 

gl28 - 1 . pep ERVGRIWGWSHUNSVVDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

I ill! Mill Ml I ll : llllll I il I IMMi II I II I III I ll Ml iliMMII M 

ml 2 8 - 1 ERVGRIWGWSHLNSVADTPELRAVYNEIJ^PEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 
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gl28-i .pep 
ml28-l 

gl28-l .pep 



130 140 150 



160 170 180 



ij 1 1 1 M 1 1 1 ■ 1 1 i 1 1 ii 1 1 1 1 1 1 1 1 1 = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

130 140 15 ° 160 170 ieo 

190 200 210 220 230 240 



I I I I I I I I I I M ! I I I I I I I : I I I I j I I I I I I I I I I I I I I I I I I I I I I I 

tt.128-1 FDDAAPLAGI PEDALAMFAAAAQSESKTG YKIGLQI Pifin!jAVIQY^i^EIiREQiyRAYV 

190 200 210 220 230 240 



gl28-l .pep 
ml28-l 

9128-1 .pep 



TRASELSDDGKFDHTAMTnDTT.aMITrN™„r T ~™_.' _ ' ' 1 ' ' ' ' 1 ' I I I I I I I I | | 



TI^SELSDDGKFDNTANIDRTLANALQTAKLLGFKNYAELSIjATKMADTPEQVLNFIiHDL 
250 260 270 280 290 300 

310 320 330 340 350 360 

, I I I I I I I I I I M | | | | | | | | | : : : I I I I I I | I I I I I | I | I I I I I | | I 

ml28-l AK^^KDIAEVKAFARES^ 

310 320 330 340 350 360 



370 380 390 400 410 

" "" I '" I " 1 1 1 : 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 

«128-1 VLNGLFAQIKKLYGIGFTEICTVPV^KDW^ELMNGETIGGVYMJLY 

370 380 39 ° 400 410 420 

"1 111 1 1 l : MII llll! II II II ll.-lllllliiii lliiiiiiiiniiiiT 
mi28-i nmarnvBumt^Ywc^ 

430 440 «° 460 470 480 

490 

9128-1. pep ELGVSGINGVK 
llllllllll: 

ml28-l ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPIjPKELFDKMIiAAKNFO 
490 500 S10 520 530 540 

S^^r" 8 Paitial DNA Sequence was identified in N. meningitidis <SEQ ID 491 



>• 



1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 
51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 
f^CGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 
AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 
°^ G I GGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 
CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 
GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 
^^ CGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 
o G ^ ATOT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 



51 
101 
151 
201 
251 
301 
351 
401 

. t,. ™^-ww»v»™,^ iui_tLrtji_L.UA ACAGCAGGCA 

451 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 
551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGC^T GTTTCCCGCT 
601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCA6ATTCC 
651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 
701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 
751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTG^ 
801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAA^g" TCGCTGGCAA 
851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG SS££S 
951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 
1^1 G ^ ACGCCGG CGAAAAA "G CGCGAAGCCA AATACGCATT «GcScC 
lltl rnll^ ART * CTTC CC CGTCGGCAAA GTATTAAACG GACTGTTCGC 
G ? AAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 
l^ CGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 
1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCgIcS 
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1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 



OGCGTGGATG 
TGCAACTGCC 
GGCAAAGAAG 
AACCGGACAC 
TATCCGGCAT 
TTTATGGAAA 
CCACGAAGAA 
TCGCCGCCAA 
TTCGCCCTCT 
GAAAAACTGG 
TCCGACCGCC 
GCAGGCGGCT 
GAGCGCGGAC 
CAGGCAAACG 
GCGGCAGAAT 
ACTCTTGCGC 



AACGACTACA 
CACCGCCTAC 
CCCGCTTGAG 
GGCCTGCACC 
CAACGGCGTA 
ATTTCGTTTG 
ACCGGCGTTC 
AAACTTCCAA 
TTGATATGAT 
CAACAGGTTT 
CGAATACAAC 
ATTCCGCAGG 
GCATACGCCG 
CTTTTGG CAG 
CCTTCAAAGC 
CACAG CGGCT 



AAGGC CGCCG 
CTCGTCTGCA 
CCATGACGAA 
ACCTG CTTAC 
GAATGGGACG 
GGAATACAAT 
CCCTGCCGAA 
CG CGGAATGT 
GATTTACAGC 
TAGACAG CGT 
CGCTTCGCCA 
CT ATTACAG C 
CCTTTGAAGA 
GAAATCCTCG 
CTTCCGCGGA 
TCGACAACGC 



CCGTTTTTCA 
ACTTCACCCC 
ATCCTCACCC 
CCAAGTCGAC 
CAGTCGAACT 
GTCTTGGCGC 
AGAACTCTTC 
TCCTCGTCCG 
GAAGACGACG 
GCGCAAAGAA 
ACAGCTTCGG 
TACGCGTGGG 
AAG CGACG AT 
CCGTCGGCGG 
CGCGAACCGA 
GGCTTGA 



GACGGCACGC 
GCCCGTCGGC 
TCTTCCACGA 
GAACTGGGCG 
GCCCAGTCAG 
AAATGTCCGC 
GACAAAATGC 
CCAAATGGAG 
AAGGCCGTCT 
GTCGCCGTCG 
CCACATCTTC 
CGGAAGTATT 
GTCGCCGCCA 
ATCGCGCAGC 
GCATAGACGC 



This corresponds to the amino acid sequence <SEQ ID 492; ORF 128-1. a>: 



al28-l .pep 

1 MTDNALLHLG 
NTVEPLTGIT 
GQDIELYNRF 
151 ELAKLQTEGA 
201 AAQSEGKTGY 
251 KFDNTANIDR 
301 ARRAKPYAEK 
351 EVKKYFPVGK 
4 01 I GG VYMDLYA 
4 51 GKEARLSHDE 
501 FMENFVWEYN 
551 FALiFDMMIYS 
601 AGGYSAGYYS 
651 AAESFKAFRG 



51 
101 



EEPRFDQIKT 
ERVGRIWGW 
KTIKNSPEFD 
QLSAKFSQNV 
KIGLQIPHYL 
TLENALQTAK 
DLAEVKAFAR 
VLNGLFAQIK 
REGKRGGAWM 
ILTLFHETGH 
VLAQMSAHEE 
EDDEGRLKNW 
YAWAEVLSAD 
REPSIDALLR 



EDIKPAIiQTA 
SHLNSVTDTP 
TLSHAQKTKL 
LDATDAFGIY 
AVI QYADNRK 
LLGFKNYAEL 
ESLGLADLQP 
KLYGIGFTEK 
NDYKGRRRFS 
GLHHLLTQVD 
TGVPLPKELF 
QQVLDSVRKE 
AYAAFEESDD 
HSGFDNAA* 



I AE ARE Q I AA 
ELRAAYNELM 
NHDLRDFVLS 
FDDAAPLAGI 
LREQIYRAYV 
SIiATKMADTP 
WDLGYAGEKL 
TVPVWHKDVR 
DGTLQLPTAY 
ELGVSGIKGV 
DKMLAAKNFQ 
VAWRPPEYN 
VAATGKRFWQ 



I KAQTHTG WA 
PEITVFFTEI 
GAELPPEQQA 
PEDALAMFAA 
TRASELSDDG 
EQVLNFLHDL 
REAKYAFSET 
YFELQQNGET 
LVCNFTPPVG 
EWDAVELPSQ 
RGMFIiVRQME 
RFANSFGHIF 
EILAVGGSRS 



ml28-l/al28-l 97.8% identity in 677 aa overlap 

10 20 30 40 50 60 

al 2 8 - 1 . pep MTDNALLHLG EEPRFDQ I KTEDI KPALQTAI AEAREQI AAI KAQTHTG WANTVE PLTG I T 

MM II! Ili II I MM ii II MM II I 1 II I HUM I lllll MM II I il I >i I III 

ml 2 8 - 1 MTDNALLHLGEEPRFDQIKTEDI KPALQTAI AEAREQI AAI KAQTHTGWANTVE PLTG IT 

10 20 30 40 50 60 



al28-l .pep 



ml28-l 



70 80 90 100 110 120 

ERVGRIWGWSHLNSVTDTPELRAAYNEIiMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

II I ! 1 1 I II I M I II Ml MM I MM I I III I II M II M II M M M Mi I Ml II M 

E RVGRI WG WSHLNS VADT PELRAVYNE LMPE I TVFFTE IGQDI ELYNRFKTI KNS PEFD 
70 80 90 100 110 120 



130 140 150 160 170 180 

al2 8 - 1 . pep TLSHAQKTKLNHDLRDFVLSGAELPPEQQAEIAKIjQTEGAQ 

III M Ml I MMMMMIMI III MM I MM! II IMMMMIMM I III Ml 

ml 2 8 - 1 TLS PAQKTKLNHDLRDFVLSGAELP PEQQAELAKLQTEGAQLSAKFSQNVLDATDAFG I Y 

130 140 150 160 170 180 

190 200 210 220 230 240 

al28-l -pep FDDAAPLAGI PEDALAMFAAAAQS EGKTGYKIGLQI PHYLAVI QYADNRKLREQI YRAYV 

I MIMIII MM IMIMIMI I IM I IMIMMMMMM Ml M M II MIMM 

ml 2 8 - 1 FDDAAPLAGI PEDALAMFAAAAQS ESKTG YKIGLQI PHYLAVI QYADNRELREQI YRAYV 

190 200 210 220 230 240 



al28-l .pep 



ml28-l 



250 260 270 280 290 300 

TRASELSDDG KFDNTANIDRTLENALQTAKLLGFKNYAE LS LATKMADTPEQVLNFLHDL 

II II I I MM i II I II 1 1 I IM III II MIMM Mill III M M J I II 1 1 MM IM 

TRASELSDDGKFDNTANIDRTLANALQTAKLLGFKNYAELS LATKMADTPEQVLNFLHDL 
250 260 270 280 290 300 



310 320 330 340 3S0 360 

al2 8 - 1 . pep ARRAKP YAEKDLAE VKAFARE S LGLADLQ PWDLG YAG E KLRE AKYAFS ETE VKKYF PVG K 

Mill I III IMIIMI I MM MM II Ml IMilMIIMI I MM lllll MIMM 
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ARRAKPYAEKDLAEVKAPARESLNIADLQPWDLGYASEKIiREAKYAFSETEVKKYFPVGK 
310 320 330 340 350 360 

370 380 390 400 410 420 

Y^?^ m^P f ^^T? f G FTEKT VP VWHKD VRYFE LQQNGET I GG VYMDL YAREGKRGGAWM 
I ' I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | I t | j j j 

VLNGLPAQIKKLYGIGFTEKWPV^KDVRYFELQQNGETIGGVY^ 

370 380 390 400 410 42 0 

,™ , 430 440 450 4 60 470 480 

al28-l.pep ^YKGRRRFSDGTLQLPTAYLV^ 

llllmillllll| lillliiil|:|Mlhl!IIN!M! I i 1 1 1 1 1 1 1 1 1 1 Ml i i 

»128-1 NDYKGRRRFSDGTLQLPTAYX^VCNFAPPVGGREARLSiTOEILILFHE^ 

430 440 450 460 470 480 



ml2 8-l 

a!28-l .pep 
ml28-l 



al28-l .pep 
ml28-l 

al2 8-l.pep 
ml28-l 



ml28-l 



490 50 ° S10 520 530 540 

iiii ' 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j i j i j 1 1 j 

E LG VSG I NG VEWDA VEltP S QFMENFWE YNVLAQMS AHEETG V P LP KELFD^liAAKNFO 
490 500 510 520 530 540 

550 560 570 580 590 600 

l,ll| l<llll!llll|||||l||||||i||||j||||| ; ||, :: |,|,|| |f i j I j i j 

RGMFLVRQMEFAIjFDI^IYSEDDEGRLKNWQQVLDSVRKKVAVIQPPE 

550 560 570 580 590 600 

610 620 630 640 650 660 

1 1 1 1 1 ' 1 1 1 1 1 1 1 i I ! 1 1 1 1 1 1 1 1 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j j 1 1 1 1 1 1 1 1 , 

610 62 0 tin e-Ars 



610 620 630 640 



650 660 



670 679 

al2 8 - 1 . pep repsidallrhsgfdnaax 

IIIHIIIIIMIIII!: 
ml 2 8 - 1 RE PSIDALLRHSGFDNAVX 

670 

al28-l/ P44573 

vastest: is ie) [Haem — «« — . 681 

Identities = 309/677 (45%), Positives = 415/677 (60%), Gaps = 4/677 (0%) 

Query: 4 NALLHliGEEPRFDQIKTEDIKPALQTXXXXXXXXXXXXXXXTHTGWANTVEPLTGITERV 63 

N LL++ P F QIK E I+PA++ H W N + PLT +R+ 

Sb]Ct: 5 NPIiLNIQGLPPFSQIKPEHIRPAVEKLIQDCRNTIEQVIiKQPHFTWENFILPLTETNDRL 64 

Query: 64 GRIWGWSHIJISVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRfktIKHSPEFDTLS 123 
R W VSHU, SV ++ ELR AY + P ++ + T +QQ LVM + +KNS Ep ^ 

Sb 3 et, 65 NRAWSPVSHLNSVKNSTELREAYQTCLPLLSEYSTWVGQHKGLYNAYIALKMSAEFADYS 124 

Query: 124 HAQKTKIJJHDLRDFVLSGAELPPEQQAEIiAKLQTEGAQLSAKFSQNVIiDATDAFGlYFDD 183 

ov, ■ * , AQK + + LRDF LSG L E+Q ++ ++I(+++ FS NVLDAT + 

Sb D ct: 125 IAQKKAIENSLRDFELSGIGLSEEKQQRYGEIVARLSELNSQFSNKVLDATMGWEKLIEN 184 

Query: 184 AAPLAGIPEDALAMFAAAAQSEGKTGYKIGIjQIPHYIjAVIQYADNRKLREQIYRAYVTRA 243 

t ,„ A LAG+PE AL +A+S+G GY+ L+IP YL V+ Y +NR LRE++YRAY TRA 

Sb 3 Ct: 185 EAELAGLPESALQAAQQSAESKGLKGYRFTIjEIPSYLPvTITYCENRALREEMYRAYATRA 244 

Query: 244 SELSDD-GKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDLAR 302 
„v. . SE + GK+DN+ ++ L ++ AKLLGF Y ELSLATKMA+ P+OVL+FL T a 

Sbjct, 245 SEC^PNAGKVTONSKVMEEILTLRVEI^I^FNTYTELSIATKMAENPQQVLDFI^hZL 304 

Query: 303 ^PYAE^V^FARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGKVL 362 

RAKP EK+L E+K + + G+ +L PWD+G+ EK ++ YA ++ E++ VPP ^ 
Sb 3 ct: 305 RAKPQGEKELQELKGYCEKEFGVTELAPWDIGFYSEKQKQHLYAINDEEIjRPYFPENRVI 364 
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373 




Query: 


363 


NGLFAQI KKLYG IG FTE - KTVPVWHKDVRYFEL - QQNGET I GG VYMDLYAREGKRGGAWM 


420 






+GLF IK+++ I E K V WHKDVR+F+L +N + G Y+DLYARE KRGGAWM 




SbjCt : 


365 


SGLFELI KRI FN I RAVERKGVDTWHKDVRFFDLIDENDQLRGS F YLDLYAREHKRGGAWM 


424 


Query : 


421 


ND YKGRRRF SDGTLQLPTA YL VCN FTP P VGG KE ARLS HD E I XXXXXXXXXXXXXXXXQVD 


480 






+D GR+R DG+++ P AY1» CNF P+G K A +H+E+ Q+D 




Sbjct : 


425 


DDC I GRKRKLDGS I ETPVAYLTCNFNAP IGNKPALFTHNEVTTLFHEFGHG IHHMLTQI D 


4 84 


Query : 


481 


ELGVSGINGVEWDAVEIiPSQFMENFVWEYNVIiAQMSAHEETGVPLPKELFDKMIiAAKNFQ 


540 






V+GINGV WDAVELPSQFMEN+ WE LA +S H ETG PLPKE ++L AKNFQ 




Sb j Ct : 


485 


VS D VAG I NG V P WD AVE L PS QFMENWCWEE EALAF I S GH YETGE PLP KE KLTQLLKAKNFQ 


544 


Query : 


541 


RGM F L VRQME FALFDMM I YS E DD EG RLKNWQ Q VLD S VRKE VAWR P PE YNRF ANS FGH I F 


600 






MF++RQ+EF +FD +-f D + L SV+ +VAV++ ++ R +SF HIF 




Sb j Ct : 


545 


AAMFI LRQLEFG I FDFRIjHHTFDAEKTNQI LDTLKS VKSQVAVI KGVDWARAPHS FSH I F 


604 


Query : 


601 


XXXXXXXXXXXXWAEVIjSADAYAAFEESDDV - AATGKRFWQEI LAVGGSRSAAESFKAFR 


659 






WAEVLSADAY+ FEE TGK F EIL GGS E FK FR 




Sbjct : 


605 


AGGYAAGYYSYLWAEVLSADAYSRFEEEGI FNPITGKSFLDEI LTRGGSEEPMELFKRFR 


664 


Query : 


660 


GRE P S I DALLRHS G FDN 6 76 








GREP +DALLRH G N 




SbjCt : 


665 


GRE PQLDALLRHKG I MN 681 





The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 493>: 

gl29.seq 

1 ATGCTTTCAC CTCCTCGGCG TAAAACGGCG GCACATCAAT CAAGCCGTCT 

51 TTCATTTGCG TGCGGAAAAA ATGCGGCGTG TTGCCGTGAT CAAAATCAAT 

101 ATCGTGCAGC ATCCAGCCCA AATCGCGGTT TGCCTCGCTT TCCGATAACG 

151 CCGACGGCGG CAGCGGTTCA CCCTTATCCG CGCTTTCGCC ATTTGCCCTT 

201 TCAGGCTGCG GGCATAGGGG CGGAACAGGC GGCGGTCGAA TCCTGTTTCA 

251 TCCGGACAAA CGCGTTGGCA GTCGGAAAAT CCGGCCGGCC GTGTCAAATA 

3 01 ATGCGTTACT TTGGCCGGGT CTTGTCCTTT GTAAGCGGCG GTCTTTTTTT 

3 51 GCGCGCCATC CGCATCTGTT TGGGCGCATG GCAAACGGCG GCTGCCGTAC 

4 01 AATCAAAATG TTTGGCGATT TCATGCAGAC AGGCATCCGG ATGCCGCCCG 
451 ACATATCGAG CCGGTTTTTG CCTATCCGAT TTGGCGGCAT TTAGGCCGGT 
501 AACTTGA 

This corresponds to the amino acid sequence <SEQ ID 494; ORF 129.ng>: 

gl2 9 .pep 

1 MLS PPRRKTA AHQSSRLSFA CGKNAACCRD QNQYRAASSP NRGLPRFPIT 

51 PTAAAVHPYP RFRHLPFQAA GIGAEQAAVE SCFIRTNALA VGKSGRPCQI 

101 MRYFG RVLSF VSGGLFLRAI RIC LGAWQTA AAVQSKCLAI SCRQASGCRP 

151 TYRAGFCLSD LAAFRPVT* 

The following partial DNA sequence was identified inN. meningitidis <SEQ ID 495>: 

ml29.seq (partial) 

1 . . TATCTGCGCT TTCACTATTT GCCCTTTCAG GCTGCGGGCA TAGGGACGGA 

51 ACAGGTAGCG GTCAAATCCT GTTTCATCCA AATAAACACG TTGGTAGTCG 

101 GAAAATTCGG CCGGCTGTGT CAAATAATGC GTTACTTTGG CCGGGTCTTG 

151 TTCTTTGTAA GTGGTGGTCT TTTTTTGCGC GTTATCCCCA TCTGTTTGAG 

201 TGCATAGCAA ATGGTGGCTG CCGTACAATC AAAATGTTTG GCGATTTCAT 

251 GCAGATAGGC ATCCGGGTGT TGCCCAACAT ATTGAGCCGG TTTTTGCCTA 

301 TCCGATTTGA CGGCATTTAG ACCGGTAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 496; ORF 129>: 

ml2 9.pep (partial) 

1 ..YLRFHYLPFQ AAG I GTEQVA VKSCFIQINT LWGKFGRLC QIMRYFGRVL 

51 FFVSGGIiFLR VIPIC LSAXQ MVAAVQS KCL AISCRXASGC CPTYXAGFCL 

101 SDLTAFRPVT * 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 129 shows 79.1% identity over a 1 10 aa overlap with a predicted ORF (ORF 129 n«rt 
from N. gonorrhoeae: 1 **-"&J 

ml29/9l29 



ml29 -pep 
gl29 

ml29 .pep 



10 20 30 

YLRFHYLPFQAAGIGTEQVAVKSCFIQINT 

RDQNQYIU^ASSPNRGLPRFPITPTAAAVHPYPRFRHLPFOA 
30 40 50 60 70 80 

„,_„ J :|N 11 NIIM Mill MINI : I hi I MINIM IMMI mm 

9129 LAVGKSGRPCOIMRYFGRVLSFVSGGLFLRAIRICLGAWQTAAAVQSKCIiAISCRO 
90 100 HO 120 130 140 

100 110 
ml 2 9 . pep CPTYXAGFCIiSDLTAFRPVTX 

Nl M ' II I I : | ' | | | I 
9129 RPTYRAGFCLSDLAAFRPVTX 
ISO 160 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 497>- 

al29.seq (partial) y ' 

J »£™ C ° CT TTCACTATTT GCCCTTTCAG GCTGCGGGCA TAGGGACGGA 

51 ACAGGTAGCG GTCAAATCCT GTTTCATCCA AATAAACACG TTGGTAGTCG 

101 GAAAATTCGG CCAGCTGTGT CAAATAATGC GTTACTTTGG CCGGGTCTTG 

151 TTCTTTGTAA GTGGTGGTCT TTTTTTGCGC GTTATCCCCA TCTGTTTGAG 

201 TGCATAGCAA ATGGTGGCTG CCGTACAATC AAAATGTTTG GCGATTTCAT 

251 GCAGATAGGC ATCCTGGTGT TGCCCAACAT ATTGAGCCGG TTTTTGCCTA 

301 TCCGATTTGA CGGCATTTAG ACCGGTAACT TGA nil TGCCTA 

This corresponds to the amino acid sequence <SEQ ID 498- ORF 129 a>- 

al29.pap (partial) ' ' ' 

1 YLRFHYLPFQ AAGIGTEQVA VKSCFIQINT LWGKFGQLC QIMRYFGRVL 

51 FFVSGGLFLR VIPICLSA*Q MVAAVQSKCL AISCR'ASWC CPTY*AGFCL 

101 SDLTAFRPVT * OL 

ml29/al29 98.2% identity in 1 1 0 aa overlap 

ml29.pep ^RFHYLPFQAAGIGT E QyAVKSCFIQIN?LWGKFGR^^ 

al29 v R r™pl™l™Ji ' I I I I I I : H I I I I I I I I | | | | | | | | I I I | 



1 1 1 1 1 1 ' I I I I 'I II II M I I I I I I I I I I I | | : | | | | [ | | I | I I I I I n i i I i i i 

YLRFHYLPFQAAGIGTEQVAVKSCFIQtNTLWGKFGQLcilmYFGRviFFVSGGiFiR 
10 20 30 40 50 60 



70 80 90 100 no 

ml29 . pep VIPICLSAXQMVAAVQSKCLAISCRXASGCCPTYXAGFCLSDLTAFRPVTX 
a ,,o I I I I I I I I I M I M I M I I N I M M I I I I I I I I |'| I I i i i i i i i i I I I I 

al29 VIPICLSAXO^AAVQSKCIAISCRXASWCCPTYXAGFCLSDLTAFRPVTX 

70 80 90 100 HQ 



The foWowing^ partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 499> : 



51 
101 
151 
201 



1 ATGAAACAAC TCCGCGACAA CAAAGCCCAA GGCTCTGCAC TGTTTACCCT 
1 TGTGAGCGGT ATCGTTATTG TTATTGCAGT CCTTTATTTC CTGATTAAGC 
TGGCGGGCAG TGGATCGTTC GGCGATGTCG ATGCCACTAC GGAAGCGGCA 
A ^ A ° ACCC GC ATCCAGCC TGTCGGACAA TTGACGATGG GTGACGGCAT 
CCCCGTCGGC GAACGCCAAG GCGAACAGAT TTTCGGCAAA ATCTGTATCC 
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2 51 AATGCCACGC GGCGGACAGC AATGTGCCGA ACGCTCCGAA ACTGGAACAC 

301 AACGGCGACT GGGCGCCGCG TATCGCGCAA GGCTTCGATA CCTTGTTCCA 

351 ACACGCGCTG AACGGCTTTA ACGCCATGCC TGCCAAAGGC GGTGCGGCAG 

4 01 ACCTGACCGA TCAGGAACTC AAACGGGCGA TTACCTACAT GGCGAATAAA 

451 AGCGG CGGTT CTTTCCCGAA TCCTGATGAG GCTGCGCCTG CCGACAATGC 

501 CGCTTCAGGA ACAGCTTCTG CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG 

551 CGAAGGCAGA AGACAAGGGT GCGGCAGCCC CTGCGGTCGG CGTTGACGGT 

601 AAAAAAGTCT TCGAAGCAAC CTGTCAGGTG TGCCACGGCG GTTCGATTCC 

651 CGGTATTCCC GGCATAGGCA AAAAAGACGA TTGGGCACCG CGTATCAAAA 

701 AAGGCAAAGA AACCTTGCAC AAACATGCCC TTGAAGGCTT TAACGCGATG 

751 CCGGCCAAAG GCGGCAATGC AGGTTTGAGC GATGACGAAG TCAAAGCGGC 

801 TGTTGACTAT ATGGCAAACC AATCCGGTGC AAAATTCTAA 

This corresponds to the amino acid sequence <SEQ ID 500; ORF 130.ng>: 

g!30.pep 

1 MKQLRDNKAQ GSAL FTLVSG IVIVIAVLYF LI KLAGSGSF GDVDATTEAA 

51 TQTRIQPVGQ I/TMGDGIPVG ERQGEQIFGK ICIQCHAADS NVPNAPKLEH 

101 NGDWAPRIAQ GFDTLFQHAL NGFNAMPAKG GAADLTDQEL KRAITYMANK 

151 SGGSFPNPDE AAPADNAASG TASAPADSAA PAEAKAEDKG AAAPAVGVDG 

201 KKVFEATCQV CHGGSIPGIP GIGKKDDWAP RIKKGKETLH KHALEGFNAM 

251 PAKGGNAGLS DDEVKAAVDY MANQSGAKF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 50 1>: 

ml3 0 . seq (partial ) 

1 . .GGCGAACAGA TTTTCGGCAA AATCTGTATC CAATGCCACG CGGCGGACAG 

51 CAATGTGCCG AACGCTCCGA AACTGGAACA CAACGGCGAT TrGGCACCGC 

101 GTATCGgCAA GGCTTCGATA CCTTGTTCCA ACACGCGCTG AACGGCTTTA 

151 ACGCCATGCC TGCAAAAGGC GGTGCGGCAG ACCTGACCGA TCAGGAACTT 

201 AAACGGGCGA TTACTTACAT GGCGAACAAA AGCGG CGGTT CTTTCCCGAA 

251 TCCTGATGAG GCTGCGCCTG CCGACAATGC CGCTTCAGGA ACAGCTTCTG 

301 CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG CGAAGGCAGA AGACAAGGGT 

351 GCGGCAcCCC TGCGGTCGGC GTTGACGGTA AAAAAGTCTT CGAAGCAACC 

4 01 TGTCAGGTGT GCCACGGCGG TTCGATTCCC GGT AT TCCCG GCATAGGCAA 

451 AAAAGACGAT TGGGCACCGC GTATCAAAAA AGGCAAAGAA ACCTTGCACA 

501 AACACGCCCT TGAAGGCTTT AACGCGATGC CTGCCAAArG CGg CAATGCA 

551 GGTTTGAGCG ATGACGAAgT CAAAGCGGCT GTTGACTATA TGGCAAACCA 

601 ATCCGGTGCA AAATTCTAA 

This corresponds to the amino acid sequence <SEQ ID 502; ORF 130>: 

ml30 .pep (partial) 

1 . .GEQIFGKICI QCHAADSNVP NAPKLEHNGD XAPRI QGFDT LFQHALNGFN 
51 AMPAKGGAAD LTDQELKRAI TYMANKSGGS FPNPDEAAPA DNAASGTASA 
101 PADSAAPAEA KAEDKGAAPA VGVDGKKVFE ATCQVCHGGS IPGIPGIGKK 

151 DDWAPRIKKG KETLHKHALE GFNAMPAKXG NAGLSDDEVK AAVDYMANQS 
201 GAKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 130 shows 98.1% identity over a 206 aa overlap with a predicted ORF (ORF 130.ng) 
from N. gonorrhoeae: 

ml30/gl30 

10 20 30 

ml30 .pep GEQ I FGKI C I QCHAADSNVPNAPKLEHNGD 

I I I il I I I I I II M II III I I I MM Ml I 

gl3 0 DATTEAATQTRIQPVGQLTMGDGIPVGERQGEQIFGKICIQCHAADSNVPNAPKLEHNGD 
50 60 70 80 90 100 

40 50 60 70 80 89 

ml3 0 . pep XAPRI -QGFDTLFQHALNGFNAMPAKGGAADLTDQELKRAI TYMANKSGGS FPNPDEAAP 

llll II Ml i I i III i I I IIMII III M I I II M I II III Ml II I I I I I Ml I il I 

gl3 0 WAPRI AQGFDTLFQHALNGFNAMPAKGGAADLTDQELKRAITYMANKSGGSFPNPDEAAP 

110 120 130 140 150 160 
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90 100 110 120 130 140 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIMIIIMIIIIIIIIMIIIIIMIIl? 

ADNAASGTASAPADSAAPAEAKAEDKGAAAPAVGVTX^ 

170 180 19° 200 210 220 



gi3 0 



ml3 0 .pep 
9130 



150 160 I'O 180 190 200 

KroDWAPRIKKGKETLHKHALEGFNAMPAKXGNAGLSDDEVKAAVDYMANQSGAKFX 

HI I III UNI NIMH I MM I Nil I 1 1 1 1 1 N N I M 1 1 1 NH II mn 

K^DWAPRIKKGKETLHKHALEGFNAMPAKGGNAGLSDDEVKA^ 

230 240 250 260 270 280 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 503>- 

al30.seq ^ 

1 ATGAAACAAC TCCGCGACAA CAAAGCCCAA GGCTCTGCAC TGTTTACCCT 

51 TGTGAGCGGT ATCGTTATTG TTATTGCAGT CCTTTATTTC CTGATTAAGC 

^ I^^° GCAG CGGCTCGTTG GGCGATGTCG ATGCCACTAC GGAAGCAGCA 
J" GTATCCAGCC TGTCGGACAA TTGACGATGG GCGACGGCAT 

201 CCCCGTCGGC GAACGCCAAG GCGAACAGAT TTTCGGCAAA ATCTGTATCC 

251 AATGCCACGC GGCGGACAGC AATGTGCCGA ACGCT CCGAA ACTGGAACAC 

301 AACGGCGATT GGGCGCCGCG TATCGCGCAA GGCTTCGATA CCTTGTTCCA 

351 ACACGCGCTG AACGGCTTTA ACGCCATGCC TGCCAAAGGC GGTGCGGTAG 

401 ACCTGACCGA TCAGGAACTC AAACGGGCGA TTACTTACAT GGCGAACAAA 

451 AGCGGCGGTT CTTTCCCGAA TCCTGATGAG GCTGCGCCTG CCGACAATGC 

?Sf r^IJ CAGGA ACAGCTTC ^ G CTCCTGCCGA TAGTGCAGCT CCGgSJUg 

551 cgaaggcaga agacaagggt gcggcagccc ctgcggtcgg cgttgacggt 
601 aaaaaagtct tcgaagcaac ctgtcaggtg tgccacggcg gttcgattcc 

651 CGGTATTCCC GGCATAGGCA AAAAAGACGA TTGGGCACCG CGTATCAAAA 
7^ AACCTTGCAC AAACACGCCC TTGAAGGCTT TAACGCGATG 

^^ CAAAG GCGGCAATGG AGGTTTGAGC GATGACGAAG TCAAAGCGGC 
801 TGTTGACTAT ATGGCAAACC AATCCGGTGC AAAATTCTAA 

This corresponds to the amino acid sequence <SEQ ID 504; ORF 130 a>- 

al30.pep ' 

1 MKQLRDNKAQ GSAL FTLV5G IVIVIAVLYF T.T KT.ar.g^ir GDVDATTEAA 



101 

151 
201 
251 



TQTRIQPVGQ LTMGDGIPVG ERQGEQIFGK ICIQCHAADS NVPNAPKLEH 
NGDWAPRIAQ GFDTLFQHAL NGFNAMPAKG GAVDLTDQEL KRAITYMANK 
SGGSFPNPDE AAPADNAAS G TASAPADSAA PAEAKAEDKG AAAPAVGVDG 
KKVFEATCQV CHGGS1PGIP GIGKKDDWAP RIKKGKETLH KHALEGFNAM 
PAKGGNAGLS DDEVKAAVDY MANQSGAKF* ^U^tNAM 



ml30/al30 97.6% identity in 206 aa overlap 

ral30.pep 10 20 30 

P GEQIFGKICIQCHAADSNVPNAPKLEHNGD 

DATTEAATQTRIQPVGQLTMGDGIPVGERQGEQIFGKICIQCHAADSN^ 

50 60 7 ° 80 90 100 

ml30.p ep ^ PR l-Q G fDTLFQHA^ 

al30 WAPRIAQGFDTLFQHALNGFNAMPAKGGAVDLTDQELKRAITYMANKSGGSF^ 

11° 120 130 140 150 160 

90 100 110 120 130 un 

ml30 . pep A DNAASGTASAPADSAAPAEAKAEDKGAA-PAVGVDGKKVFEATCQVCHGGSIPGIPGir 

180 190 200 210 220 

Bl , n 150 I 60 170 180 190 ?nn 

ml30 . pep KKDDWAPRIKKGKETLHKHALEGFNAMPAKXGNAGLSDDEVKAAVDYMANQSGAKFX 
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al30 KKDDWAPRIKKGKETLHKHALEGFNAMPAKGGNAGLSDDEVKAAVDYMANQSGAKFX 
230 240 250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 505>: 

gl32 . seg 

1 ATGGAAGCCT TCAAAACCCT AATTTGGATT ATTAATATTA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACCTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CCAACTTcct CAgccGCTCG AccGccGTTG CAGCAACAtt 

201 tttcttTGca acctgcAtgg gctatggTgt atattcacac CCACACGACA 

251 AAACACGGTT TGGACTtcag caacataCGA CAGACTCAGC AagcACCCAA 

301 ACCcgtAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACagtTTTT CAAATgccga caTGgtga 

This corresponds to the amino acid sequence <SEQ ID 506; ORF 132.ng>: 

gl32 .pep 

1 MEAFKTLIWI INIISALAV I VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS TAVAATFF FA TCMGYGVYSH PHDKTRFGLQ QHTTDSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 507>: 

ml32.seq (partial) 

1 ATGGAACCCT TCAAAACCTT AATTTGGATT GTTAATTTAA TTTCCGCTTT 

51 GGCCGTCTTC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGA. . . 

This corresponds to the amino acid sequence <SEQ ID 508; ORF 132>: 

ml32 .pep (partial) 

1 MEPFKTLIWI VNLISALAV F VLVLLQHGKG ADAGATFG . . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 132 shows 89.5% identity over a 38 aa overlap with a predicted ORF (ORF 132.ng) 
from N. gonorrhoeae: 

ml32/g!32 

10 20 30 

ml3 2 . pep ME P FKTL I W I VNL I S ALAVFVL VLLQHGKGADAGATFG 

II IIIIMhhIllllhllllMIIIIMIIMM 

gl3 2 MEAFKTLIWI INI IS ALAVI VLVLLQHGKGADAGATFGSGSGSAQGVFGS AGNANFLSRS 

10 20 30 40 50 60 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 509>: 

al32 . seq 

1 ATGGAAGCCT TCAAAACCCT AATTTGGATT GTTAATATAA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CTAACTTCCT CAGCCGCTCG ACCGCCGTTG CAGCAACATT 

201 TTTCTTTGCA ACCTGCATGg GCTATGGTGT ATATTCACAC CCACACGACA 

251 AAACACGGTT TGGACTTCAG CAACGTACAA CAAACTCAGC AAGCACCCAA 

301 ACCCGTAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACAGTTTTT CAAATGCCGA CATGGTGA 

This corresponds to the amino acid sequence <SEQ ID 510; ORF 132.a>: 

al32 .pep 

1 MEAFKTLIWI VNIISALA VI VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS TAVAATFFFA TCMGYGVYSH PHDKTRFGLQ QRTTNSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 

m 1 32/al 32 92. 1 % identity in 3 8 aa overlap 
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10 20 30 

ml32 . pep ME P FKTL I W I VNL I S ALAV FVLVLLQHGKGADAGAT FG 

II I I 1 I I I 1 M : I I I I I I : I 1 t I I I f M | M I t M II 
al32 MEAFKTLIWIVNIISALAVIVLVLLQHGKGADAGAT FGSGSGS AQGVFGSAGNANFLSRS 

10 20 30 40 50 60 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 5 1 1 >: 

gl34 . seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 
51 CATCTCCCAC CCCGATGCGG GTAAAACCAC GCTGACCGAA AAACTGCTGC 
101 TGTTTTCGGG CGCGATTCAA AGCGCAGGCA CGGTGAAAGG TAAGAAAACC 
151 GGCAAATTCG CCACCTCCGA CTGGATGGAC ATCGAGAAGC AGCGCGGCAT 

2 01 TTCCGTGGCA TCAAGCGTGA TGCAGTTCGA CTACAAAGAC CACACCGTCA 
251 AC CTCTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

3 01 GTTTTAACCG CAGTGGACAG CGCCTTGATG GTCATCGACG CGGCAAAAGG 

3 51 CGTGGAAGCG CAAACCATCA AACT CTTGAA CGTCTGCCGC CTGCGCGATA 

4 01 CGCCGATTGT TACCTTCATG AACAAATACG ACCGCGAAGT GCGCGATTCT 
451 TTGGAACTCT TGGACGAAGT GGAAGACATC CTGCAAATCC GCTGCGCGCC 

5 01 CGTTACCTGG CCGATCGGTA TGGGCAAAAA CTTCAAGGGC GTGTACCACA 
551 TCCTGAACGA CGAAATCTAT CTCTTTGAAG CGGGCGGCGA ACGCCTGCCG 
601 CACGAGTTCG ACATCATCAA AGGCATAAAC AATCCCGAAT TGGAACAACG 
651 CTTTCCGTTG GAAATCCAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 
701 CGGCTTCCAA CGAATTTAAT CTCGacgaAT TTCTCGccgG CGAACTCACG 
751 CCAGTGTTCT TCGGCTCTGC GATTAACAAC TTCGGCATTC AGGAAATCCT 
801 CAATTCATTG ATTGACTGGG CACCCGCACC GAAACCGCGC GACGCGACCA 
851 TGCGCATGGT CGGGCCGGAC GAGCCGAAAT TTTCCGGATT TATCTTTAAA 
901 ATCCAAGCCA ATATGGACCC GAAACACCGC GAC CGTATCG CCTTCTTGCG 
951 CGTCTGCTCC GGTAAATTCG AGCGCGGCAT GAAGATGAAA CACCTGCGTA 

1001 TCAACCGCGA AATCGCCGCC TCCAGCGTAG TAACCTTCAT GTCGCACGAC 

1051 CGCGAACTGG CGGAAGAAGC CTACGCCGGC GACATCATCG GCATCCCGAA 

1101 CCACGGCAAC ATCCAAATCG GCGACAGCTT CTCCGAAGGC GAACAACTGG 

1151 CGTTTACCGG CATCCCATTC TTCGCGCCCG AACTGTTCCG CAGCGTCCGC 

1201 ATCAAAAACC CGCTGAAAAT CAAACAACTG CAAAAAGGTT TGCAACAACT 

12 51 CGGCGAAGAA GGTGCGGTTC AAGTATTCAA ACCGATGAGC GGCGCGGATT 
1301 TGATTTTGGG TGCGGTCGGC GTGTTGCAGT TTGAAGTCGT AACCTCACGC 

13 51 CTCGCCAACG AATACGGCGT GGAAGCCGTG TTCGACAGCG CATCCATCTG 

14 01 GTCGGCGCGC TGGGTATCGT GCGACGACAA GAAAAAACTG GCGGAATTTG 
1451 AAAAAGCCAA CGCAGGCAAC CTCGCCATCG ACGCAGGCGG CAACCTCGCC 

15 01 TACCTCGCCC CCAACCGCGT GAATTTGGGG TTGACGCAAG AACGCTGGCC 
1551 GGACATCGTG TTCCACGAAA CGCGCGAACA TTCGGTCAAA CTCTAA 

This corresponds to the amino acid sequence <SEQ ID 512; ORF 134.ng>: 

gl34 .pep 

1 MSQEILDQVR RRRTFAI ISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWMD IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VI DAAKGVEA QTIKLLNVCR LRDTPIVTFM NKYDREVRDS 

151 LELLDEVEDI LQIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERL.P 

2 01 HEFDIIKGIN NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

2 51 PVFFGSAINN FGIQEILiNSL IDWAPAPKPR DATMRMVGPD EPKFSGFIFK 

3 01 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 
351 RELAEEAYAG DIIGIPNHGN IQIGDSFSEG EQLAFTGIPF FAPELFRSVR 

4 01 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADL I LGAVG VLQFEWT SR 

4 51 LANEYGVEAV FDSASIWSAR WVSCDDKKKL AEFEKANAGN LAI DAGGNLA 

5 01 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 513>: 

ml34.seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCTGACGCAG GTAAAACCAC GTTGACTGAA AAACTCTTGC 

101 TGTTTTCGGG CGCGATTCAG AGCGCGGGTA CGGTAAAAGG CAAGAAAACC 

151 GGCAAATTCG CCACTTCCGA CTGGATGGAA ATCGAGAAGC AGCGCGGCAT 
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201 TTCCGTGGCA TCAAGTGTGA TGCAGTTCGA TTACAAAGAC CACACCGTCA 

251 ACCTCTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

301 GTTTTAACCG CCGTGGACAG CGCATTAATG GTCAT CGACG CGGCAAAAGG 

351 CGTGGAAGCG CAAAC CATCA AGCTCTTAAA CGTCTGCCGC CTGCGCGATA 

4 01 CACCGATTGT TACGTTTATG AACAAATACG ACCGCGAAGT GCGCGATTCC 

4 51 CTGGAACTTT TGGACGAAGT GGAAAACATT TTAAAAATCC GCTGCGCGCC 

501 CGTTAC CTGG CCGATCGGTA TGGGCAAAAA CTTCAAGGGC GTGTACCACA 

551 TCCTGAACGA TGAAATTTAT CTCTTTGAAG CTGGCGGCGA ACGCCTGCCG 

601 CACGAGTTCG ACATCATCAA AGGCATCGAT AATCCTGAAT TGGAACAACG 

651 CTTTCCGTTG GAAATCCAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 

701 CGGCTTCCAA CGAGTTTAAT CTCGACGAAT TCCTCGCCGG CGAACTCACG 

751 CCCGTATTCT TCGGCTCTGC GATTAACAAC TTCGGTATTC AGGAAATCCT 

801 CAATTCATTG ATTGACTGGG CGCCCGCGCC GAAACCGCGC GACGCGACCG 

851 TACGTATGGT CGAGCCGGAC GAGCCGAAGT TTTC CGGATT TATCTTCAAA 

901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATTG CCTTCTTGCG 

951 CGTCTGCTCC GGCAAATTCG AGCGCGGCAT GAAGATGAAA CACCTG CGTA 

1001 TCAACCGCGA AATCGCCGCC TCCAGCGTGG TTACCTTCAT GTCGCACGAC 

1051 CGCGAGCTGG TTGAAGAAGC CTACGCCGGC GACATTATCG GCATCCCGAA 

1101 CCACGGCAAC ATCCAAATCG GCGACAGCTT CTCCGAAGGC GAACAACTGG 

1151 CGTTCACCGG CATCCCATTC TTCGCACCCG AACTGTTCCG CAGCGTACGC 

12 01 ATCAAAAACC CGCTGAAAAT CAAACAACTG CAAAAAGGCT TGCAACAGCT 
1251 CGGCGAAGAA GGCGCGGTGC AGGTGTTCAA ACCGATGAGC GGCGCGGATT 

13 01 TGATTTTGGG CGCGGTCGGC GTGTTGCAGT TTGAAGTCGT TACCTCGCGC 

13 51 CTCGCCAACG AATACGGCGT AGAAGCCGTG TTCGACAGCG CATCCATCTG 

14 01 GTCGGCGCGC TGGGTATCGT GCGACGACAA GAAAAAACTG GCTGAATTTG 
14 51 AAAAAGCCAA CGCGGGCAAC CTCGCCATCG ACGCAGGCGG CAACCTCGCC 
1501 TACCTCGCCC CCAACCGCGT GAATTTGGGA CTCACGCAAG AACGTTGGCC 
1551 GGACATCGTG TTCCACGAAA CACGCGAACA TTCGGTCAAA CTGTAA 

This corresponds to the amino acid sequence <SEQ ID 514; ORF 134>: 

ml34 .pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWME IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VI DAAKGVE A QTIKLLNVCR LRDTPIVTFM NKYDREVRDS 

151 LELLDEVENI LKIRCAPVTW PIGMGKNTKG VYHILNDEIY LFEAGGERLP 

2 01 HEFDIIKGID NPELEQRFPL EIQQLRDEIE LVQAASNE FN LDE F1AGELT 
251 PVFFGSAINN FGIQEIIjNSL IDWAPAPKPR DATVRMVEPD EPKFSGFIFK 

3 01 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 
351 RELVE EAYAG DIIGIPNHGN IQIGDSFSEG EQLAFTGIPF FAPELFRSVR 
401 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADL I LGAVG VLQFEW TSR 
451 LANEYGVEAV FDSASIWSAR WVSCDDKKKL AEFEKANAGN LAI DAGGNLA 
501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N. gonorrhoeae 

ORF 134 shows 98.7% identity over a 531 aa overlap with a predicted ORF (ORF 134.ng) 
from N. gonorrhoeae: 

ml34/gl34 

ml34 .pep 
gl34 



m!34 .pep 
gl34 

70 80 90 100 110 120 

130 140 150 160 170 180 



10 20 30 40 50 60 

MSQEI LDQVRRRRTFAI I SHPDAGKTTLTEKIiLLFSGAI QSAGTVKGKKTGKFATSDWME 

llllll MINI I! Illllllllll ill IMMI IIMIIMMI I M MIMIII ih 

MSQEILDQVRRRRTFAI I SHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWMD 
10 20 30 40 50 60 

70 80 90 100 110 120 

I EKQRGI SVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 

Mill M IIMIMI IMMIIIMI III! Mill Ml IMIIM II III II I INI !i 

I E KQRGI S VASS VMQFDYKDHTVNLLDTPGHQDFS EDTYRVLTAVDSALMVI DAAKGVE A 
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9134 QTIKLLNVCRLRDTP I VTFMNKYDREVRDSLELLDEVEDI LQ IRCAPVIWP IGMGI^FKG 

130 140 150 160 170 iso 

ml34.pep ^"^^"F T T^^'T'^'^^'^^^^^^^T*^' ^ ^* ^jEQRFPLE I QOIjRX>EX EX/VQAASNEFN 

IIIMIIIIII| INIIIII ||||||:|||||||||||||||||,|||||,|||TT7 
9134 VYHILlTOEiyLFEAGGERLPHEFDIIKGIlTOPEIjEQRFPLEIQQLRDEIEL^ 

190 200 210 220 230 240 

250 260 270 280 290 Ton 

ml34 .pep LDE FIAGELTPVFFGSAINNFGIOEHJJSLIDWAPAPKPRDATVRMVEPDEPKFSGFIFK 

IIIMIMIIIIIIMIIIIIMMIMIIIIIMllliiihiM N m 

IjDEFLAGELTPVFFGSAINNFGIOEIIiNSLIDWAPAPKPRDATMRIWGPDEP^SGFI FK 
250 260 270 280 290 300 



9134 

ml34 -pep 
gl34 



310 320 330 340 350 36 n 

?? A_ = ^ 

IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSVA^FMS 

310 320 330 340 350 360 



ml34. P ep DI I GI P^GNIQIGDSFSEGEQIiAFTGI PFFAPELFRSVRI KNPLKI KQLQKGLQQLGEE 

' 11 1 III I llll MINIM llll Mill INN || 1 1| || || IN Hill I III Mill 

9134 DIiaiBNHCBnQIGDSFSEaEQIJ^ 

370 380 400 410 420 



«nl34 .pep ™Oy™f^ 

1 1 1 1 M I M 1 1 II 1 1 1 1 1 1 1 II I 1 1| 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 ini 

9134 GAVQVFKPMSGADLILGAVGVLQFEVVTSRLANEYG^AVFDSASIW^ 

430 440 450 460 470 480 



490 500 510 S20 530 

ml34 . pep AEF E^AGNLAIDAGGNIAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 

M M M 1 1 II I II II I M 1 1 II 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



gl34 



■minimi I I I | M I I I I I I | | || I I I I I I I | I I | 

AEFEKANAGNLAIDAGGNIAYIJVPNRVNLGLTOERWPDIVFHETREHSVKI^ 
490 500 510 520 530 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 515>- 

al34.seq ^ ^ * 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCTGACGCAG GTAAAACCAC GTTGACTGAA AAACTCTTGC 

101 TGTTTTCAGG TGCGATTCAA AGCGCGGGTA CGGTAAAAGG oSSSSS 

151 GGCAAATTCG ccacctccga ctggatggac atcgagaagc AGCGCGG^AT 

201 TTCCGTGGCA TCAAGCGTGA TGCAGTTCGA CTATaSc CACACCg£c~I 

251 ACCTTTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA SSScSJ 

301 GTTTTGACCG CCGTCGATAG TGCCTTGATG GTCATCGACG CGgSSSS 

351 CGTGGAAGCG CAAACCATCA AACTCTTGAA CGTCTGCCGC CTGCGCAATA 

401 CGCCGATTGT TACGTTCATG AACAAATACG ACCGCGAAGT GCGCGATTCC 

451 CTGGAATTGC TGGACGAAGT GGAAAACATC CTGCAAATCC GCTGCGCGCC 

501 CGTAACCTGG CCGATCGGCA TGGGCAAAAA CTTCAAAGGC GTGTACCACA 

551 TCCTGAACGA CGAAATCTAT CTCTTTGAAG CGGGCGGCGA ACGCTtSS 

601 CACGAGTTCG ACATCATCAA AGGCATCGAT AATCCCGAAT TGgSJSS 

651 CTTTCCGTTA GAAATACAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 

701 CGGCTTCCAA CGAGTTCAAT CTCGACGAAT TCCTCGCCGG C^CTcicG 

751 CCCGTATTCT TCGGCTCTGC GATTAACAAC TTCGGTATTC AGGAAATCCT 

801 CAATTCATTG ATTGAATGGG CGCCCGCGCC GAAACCACGC SSgISS 

851 TGCGTATGGT CGAGCCGGAC GAGCCGAAGT TTTCCGGAT* ^^cSS 

901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTaJtc SS£££ 
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This corresponds to the amino acid sequence <SEQ ID 516; ORF 134.a>: 

al34 .pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWMD IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VIDAAKGVEA QTIKLLNVCR LRNTPIVTFM NKYDREVRDS 

151 LELLDEVENI LQIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDIIKGID NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDE FLAGELT 

251 PVFFGSAINN FGIQEILNSL IEWAPAPKPR DATVRMVEPD EPKFSGFIFK 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 RELVEEAYAG DIIGIPNHGN IQIGDSFSEG EQLT FTGIPF FAPELFRSVR 

4 01 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEW TSR 

4 51 LANEYGVEAV FDNASIWSAR WVSCDDKKKL AE FE KAN AGN LAIDAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

ml34/al34 98.9% identity in 53 1 aa overlap 

10 20 30 40 50 60 

ml34 . pep MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWME 
I M I I I I I M M I I I I I I I I I I I I I I I I II I I M It I I I I I 11 I i I M I I I I II I I I i I : 
al34 MSQEILDQVRRRRTFAI ISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWMD 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml34 , pep IEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 
I I I I I I I I I I I I I I I I I ! I I I I I I I I I I 1 I II I M I I M I M I I II I I I I II I I I I I I M 
al34 IEKQRGI SVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVI DAAKGVEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m!34 . pep QTIKLLNVCRLRDTPIVTFMNKYDREVRDSLELLDEVENILKIRCAPVTWPIGMGKNFKG 
I I I I I I I I I I I I : I I I I I I I I II I I M I 1 I I I I I I I I I I I I : I I I M I I I I I I I I ! I I I i 
al34 QTIKLLNVCRLRNTPIVT FMNKYDREVRDS LELLDEVENI LQIRCAPVTW PIGMGKNFKG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml34 .pep VYHILNDEIYLFEAGGERLPHEFDIIKGIDNPELEQRFPLEIQQLRDEIELVQAASNEFN 
I I I I I II I I I I I I 11 I I I I I I I M I i I I I i II M I II I I I I I I I I M I M I I | | | | | | || 
al34 VYHILNDEIYLFEAGGERLPHEFDIIKGIDNPELEQRFPLEIQQLRDEIELVQAASNEFN 

190 200 210 220 .230 240 

250 260 270 280 290 300 

ml34 . pep LDEFLAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATVRMVEPDEPKFSGFIFK 
t M I I M I I I I I I I I I I I I I I I I I I I I 1 M I : i M I I I I I II 1 I I I I I I I 1 I I I I I M I I 
al34 LDEFLAGELTPVFFGSAINNFGIQEILNSLIEWAPAPKPRDATVRMVEPDEPKFSGFIFK 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml34 . pep IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSWTFMSHDRELVEEAYAG 
I I I M I M I I I I I I I I I I I I I I I I I 1 I M I I I I I I I I I I I I I I I I I M M | I I | I || | M 
al34 IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSWTFMSHDRELVEEAYAG 

310 320 330 340 350 360 
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CGTCTGCTCC 
TCAACCGCGA 
CGCGAGCTGG 
CCACGGCAAC 
CGTTTACCGG 
ATCAAAAACC 
TGGCGAAGAA 
TGATTTTGGG 
CTTGCCAACG 
GTCGGCGCGC 
AAAAAGCCAA 
TACCTCGCCC 
GGACATCGTG 



GGCAAATTCG 
AATCGCCGCC 
TTGAAGAAGC 
ATCCAAATCG 
CATCCCATTC 
CGCTGAAAAT 
GGTGCGGTGC 
CGCGGTCGGC 
AATACGGCGT 
TGGGTATCGT 
CGCGGGCAAC 
CTAACCGCGT 
TTCCACGAAA 



AGCGCGGCAT 
TCCAGCGTGG 
CTACGCCGGC 
GCGACAGCTT 
TTCGCGCCCG 
CAAGCAACTG 
AGGTGTTCAA 
GTGTTGCAGT 
GGAAGCCGTG 
GCGACGACAA 
CTCGCCATCG 
GAATCTGGGA 
CGCGCGAGCA 



GAAAATGAAA 
TAACCTTCAT 
GACATTATCG 
CTCCGAAGGC 
AACTGTTCCG 
CAAAAAGGTT 
ACCAATGAGC 
TTGAAGTCGT 
TTCGACAACG 
GAAAAAACTG 
ACGCGGGCGG 
CTCACGCAAG 
TTCGGTCAAA 



CACCTGCGTA 
GTCCCACGAC 
GTATCCCAAA 
GAACAACTGA 
CAGCGTTCGC 
TGCAACAGCT 
GGCGCGGATT 
TACCTCGCGC 
CATCCATCTG 
GCGGAATTTG 
CAACCTCGCC 
AACGCTGGCC 
CTTTAA 
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370 380 390 400 410 420 

ml34. P e P DIIGIPNHGNIQIGDSFSEGEQLAFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 

''''''''''''''iiiiimii.-iiiiiiiiiiiiiiiiiiimihiihi,::,,,, 

al34 DIIGIPNHGNIQIGDSFSEGEQLTFTGIPFFAPELFRSVRIICNPLKIKQLQKGLQQLGEE 

370 380 390 400 410 420 

430 440 450 460 470 480 

GAVQVFKPMSGADLILGAVGVLQFEVVTSRLANEYGVEAVFDSASIWSARWVSCDDKKKL 
,IIIIIMIIIi| IIMIIIIIIIIIIIIIII|||||||||| : ,|||| | || |||M ,|,| 

al34 GAVQVFKPMSGADLILGAVGVLQFEWTSRIANEYGVEAVFDNASIWSARWVSCDDKKKL 

430 440 4 50 460 470 480 

490 500 510 520 530 

ml34 .pep AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 
H I MM Ml I III II I III I Ml I 1 || nil IN I IN mm || 1 1 |, M| 
al 3 4 AE FEKANAGNLAI DAGGNLAYLAPNRVNLGLTQERWPDI VFHETREHSVKLX 

49 0 500 510 520 530 



gl35 



. seq 




1 


ATGAAATACA 


51 


TTCGGAcgGC 


101 


TTGCCGCATT 


151 


GCGGTTGCTG 


201 


AATCGCCGAC 


251 


AAGAATATAC 


301 


CTGCTCAGCC 


351 


CGGCGCACTT 


401 


AAAACGACAC 


451 


TTGAGTGCGC 


501 


GACCGACATA 


551 


CCGTACGGCT 


601 


GCGGGCGGCT 


651 


CAAAGCGGCA 


701 


CCTCACTCAA 


751 


GGCTCGTTTT 


801 


GctggCGTTC 


851 


cggaacacgc 



gi35.pep & 

1 MKYKRIVFKV GTSSITRSDG SLSRGKIQTI TRQLAALHHA GHELVLVSSG 

51 AVAAGFGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAIP IINENDTVSV EELKIGDNDT 

151 LSAQVAAMIQ ADLLVLLTDT DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKIKAA TIAAESGVPV Y1CSSLKPDS LAEAAEHOAD 

251 GSFFVPRAKG LRTQKQWIAF YSESGGSVYV DESAEHALSE QGKAC* 

The ^^gPartal DNA sequence was identified in N. meningitidis <SEQ ID 519> : 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCCA 

51 TTCGGACGGC AGTCTCTCGC GCGGCAAAAT CCAAACCATC ACCTGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCGG CAGGGTTCGG TGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

201 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGCAGGGG CTGCTGATGG 

251 AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCGCAAATC 

3 01 CTGCTCAGCC GCGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCCGG 

351 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCCGTCCCC JJSxSSS 

401 AAAACGATAC GGTTTCGGTT GAGGAATTGA AAATCGGCGA CAACGACACA 

451 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGgScTGCT 
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501 GACCGACATA GACGGTCTTT ACACGGGCAA CCCGAACAGC AATCCCGATG 

551 CCGTACGGCT GGACAAAATC GAACACATCA ACCATGAAAT CATCGAAATG 

601 GCGGGCGGCT CGGGTTCGGC AAACGGCACG GGCGGTATGC TGACCAAAAT 

651 CAAAGCGGCA ACCATCGCCG CCGAATCCGG CGTACCGGTG TATATCTGTT 

7 01 CCTCGCTCAA ACCCGATGCA CTTGCCGAAG CTGCCGAACA TCAGGCGGAC 
751 GGCTCGTTTT TCGTCCCCCG TGCCAAAGGT TTGCGGACGC AGAAGCAATG 
801 GCTGGCGTTC TATTCCGAAA GCCGGGGCAG CGTTTATGTG GACGAAGGTG 

8 51 CGGAACACGC TTTGTCCGAA CAGGGGAAAA GCCTGCTGAT GTCGGGCATT 
901 GCCGGAATCG AAGGGCATTT TTCCCGTATG GACAC CGTAA CCGTGTACAG 
951 CAAGGCAACC AAACAGCCCC TGGGCAAAGG GCGCGTCCTG TTCGGCTCTG 

1001 CCGCCGCCGA AGACCTGCTC AAATCGCGTA AGGCGAAAGG CGTGTTCATC 

1051 CATCGGGACG ACTGGATTTC CATCACGCCC GAAATACGCC TGCTTCTGAC 

1101 CGAATTTTAG 

This corresponds to the amino acid sequence <SEQ ID 520; ORF 135>: 

ml3 5 .pep 

1 MKYKRIVFKV GTSSITHSDG SLSRGKIQTI TCQLAALHHA GHE LVLVSSG 

51 AVAAGFGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAVP IINENDTVSV EELKIGDNDT 

151 LSAQVAAMIQ ADLLVLL TDI DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKIKAA TIAAESGVPV YICSSLKPDA LAEAAEHQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESRGSVYV DEGAEHALSE QGKSLLMSGI 

3 01 AGIEGHFSRM DTVTVYS KAT KQPLGKGRVL FGSAAAEDLL KSRKAKGVFI 

3 51 HRDDWISITP EIRLLLTEF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 135 shows 97.6% identity over a 294 aa overlap with a predicted ORF (ORF 135.ng) 
from TV. gonorrhoeae: 

ml35/gl35 

10 20 30 40 50 60 

ml35 . pep MKYKRIVFKVGTSSITHSDGSLSRGKIQTITCQLAALHHAGHELVLVSSGAVAAGFGALG 

11 I I II I I I ! I II I I I : I I I I I I I I I I I I i I I I I I I II II I I I I I I I I I i II I I I I I II 
gl3 5 MKYKRI VFKVGTSS I TRSDGS LSRGKI QT I TRQLAALHHAGHELVLVS SGAVAAGFGALG 

10 20 30 40 50 60 

70 80 90 100 110 120 * 

ml3 5 . pep FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 

I II Ml I I III I 1 1 ! Mill Mill I I MMI 1 1 II I III I M I I M III h I MMmI 

g!35 FKKRPVKIADKQASAAVGOGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

tt\13 5 . pep SVLLQRRAVPI INENDTVSVEELKI GDNDTLS AQVAAMI QADLLVLLTD I DGLYTGNPNS 

1 1 1 1 1 1 I h 1 1 1 M 1 1 1 1 1 1 M 1 1 1 M I 1 1 M M M ! M 1 1 i 1 1 1 I II 1 1 1 I M 1 1 1 1 1 1 

gl3 5 SVLLQRRAI PI I NENDTVSVEELKI GDNDTLS AQVAAMI QADLLVLLTD I DGLYTGNPNS 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml35 . pep NPDAVRLDKI EHINHEI I EMAGGSGSANGTGGMLTKIKAATIAAESGVPVYI CSSLKPDA 

I M I I I MMI Ml MM MM M II I MM M I II IMMM I M I III I M M! M i: 

gl35 NPDAVRLDKI EHINHEI I EMAGGSGSANGTGGMLTKIKAATIAAESGVPVYI CSS LKPDS 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 3 5 . pep LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESRGSVYVDEGAEHALSEQGKSLLMSGI 

I Ml II I II II I II MIMI Mill Ml Ml I I I IMIIIMIIIIMMIM 

gl3 5 LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESGGSVYVDESAEHALSEQGKACX 

250 260 270 280 290 



310 320 330 340 350 360 
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ml3 5 .pep 



AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKSRKAKGVFIHRDDWISITP 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 521>: 



. seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCra 

51 TTCGGACGGC AGTCTCTCGC GCGGCAAAAT CCAAACCATC AcJcgJcAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT CTcScCGGC 

201 25S2 2S2™2 S252522 SSSSS 



251 
301 



AATCGCCGAC AAACAGGCTT CC^CGCCGT CGGGCAGGGG SSS£J£ 
AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC SoSSStc 
CTGCTCAGCC GCGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCTrr 
fni ^^ GCACTT TCCGTGC *GC TGCAACGCCG CGCCGTCcS A^JcStc 
lit ^ ACGATAC GGTTTCGGTT GAGGAATTGA AAATCGGCGA CAACGActcI 
TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTCctSS 

501 gaccgacata gacggtcttt acaccggcaa cccgaacagc aatcSat? 

551 CCGTACGGCT GGACAAAATC GAACACATCA ACCATGAAAT «tcSS£g 
«i ^ G n^ GGCT CGGGTTCGG C AAACGGCACA GGCGGTATGC 

" x ^ AAAG ^ GCG ACGATTGCG * ccgagtccgg cgtaccggtc tmSJSSS 

701 CCTCGCTCAA ACCCGATGCA CTTGCCGAAG CGGCAGATAA TCAGGCGGAC 
751 GGCTCGTTTT TCGTCCCCCG TGCCAAAGGT TTGCGGACGC AG^gSSg 
801 GCTGGCGTTC TATTCCGAAA GCAGGGGCGG CGTTTATGTG GAcJS^Tr 
^ GAACACGC TTTGTCCGAA CAGGGAAAAA GCCTGCTgI? GtISSS 

90i gccggaatcg aagggcattt ttcccgtatg gacaccgtaa ccgtgwSE 

951 CAAGGCAACC AAACAGGCTT TGGGCAAAGG GCGAGTCCTG TtSgSSg 
1001 CCGCCGCCGA AGACCTGCTC AAATTGCGTA AGGCGAAAGG CGTCTtcSc 
Ho'! 1 SS ACTGGATTTC CATCACGCCC GAAAT ACG CC S££S££ 

This corresponds to the amino acid sequence <SEQ ID 522- ORF 135 a>- 

al35.pep ' 

1 MKYKRIVFKV GTSSITHSDG SLSRGKIQTI TRQLAALHHA GHELVLVqqr 

51 AVAAG FGALG FKKRPVKIAD KQASAAVGQG LLMEEYTaS SSMfof 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAVP IINENDTVSV ISSgSS 

201 *rrTr^° ADLLVLT ' TDI DGLYTGNPNS NPDAVRLdS EHINhSS 

Si SKEKE ESSES SS5E SSSSS ££S£ 

^l 1 iSSSSS EIRLLLTEF* " ^SSS 

ml35/al35 98.4% identity in 369 aa overlap 

10 20 30 40 



20 30 40 50 go 

m l35. Pep ™= 

FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQIL^ 

° 80 90 100 no 120 

: i :: . PeP 

SVLLQRRAVPIINENDTVSVEELKIGDNDTLSAQVAAMIQADLLVLLTD^ 

° 140 150 160 170 leo 

190 200 210 

LiDKIEHINHEIIEMAGGSGSANGT 

210 220 230 



a!35 



al35 



190 200 210 220 
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250 260 270 280 290 300 

ml35 . pep LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESRGSVYVDEGAEHALSEQGKSLLMSGI 
I I I II : : I I I I I I I ! I I I I I I I I I I I I I I I I I I 1 I I : I I I I I I I M I I I I I I II I 1 I I I I 
al35 LAEAADNQADGSFFVPRAKGLRTQKQWIAFYSESRGGVYVDEGAEHALSEQGKSLLMSGI 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml35.pep AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKSRKAKGVFIHRDDWISITP 
I I I I I I i ! I i I I I I I I I ! I I I I I I I II I I I I ! I I M I I I I I I I I I I I M I I I I I I I I I I 
al35 AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKLRKAKGVFIHRDDWISITP 

310 320 330 340 350 360 



ml35 . pep 
al35 



370 

EIRLLLTEFX 
I I II I I I I 1 I 
EIRLLLTEFX 
370 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 523>: 

gl36 . seq 

1 ATGGAAATCC GGTTTCAGAC AGCATTTTTA CGTTTGGTTC AGatgaAAAC 

51 AAACGCTtca aTTCTtaccg caACACGCCT TGTATTTCCT GccgCTGCCG 

101 CACGGACAGG GATCGTTCCT GCCGgtTTTT TCCCCTTCCC TGCGGACGGT 

151 TTGCGGTTTG TTGATGACCG CCTGCCAGTA GCGGTAGATG TCtgccagcg 

201 cgTAAGGCag tTCGGAcgca agttccgcca gctcgccttc ggTGAATTGC 

251 AGgcggataa cgccgtttTC CTCTTCGTCg taaatgccgc ccactgccat 

301 cacgGGGTAA AACAGCTCTT CAAACGCTTC ATCATCGGCG GCTTCAAACC 

351 AATCGGTCGG CACAATGTCC AAACCGTAAA GATAGGCGTT GCACCAAGTG 

401 TAAAAATCGC TGCCGCCCTC GCCGTCGTCG TAGAGCCACA AATCGGGCAG 

451 CTTTTTATCC GACATCGCGG CGGTTGTTTC CATCGCCATT GCCAAAACCA 

501 GCCGTTCGAT TTCGGAACGT TCGGCGGCGG TAAATTGCGA TTCGTCGCCC 

551 AACACTTCGG GCAGCCAGTC GAGCGGTGCC AATTTGTCCG GCCCGCTCAA 

601 CAGCGCCGTC ATAAAACCTT GAACCTCGTC GCAACGCATC GTGTTGCCTT 

651 GTTCGCTTTT GGCATCCAAT AA 

This corresponds to the amino acid sequence <SEQ ID 524; ORF 136.ng>: 

gl36 .pep 

1 MEIRFQTAFL RLVQMKTNAS ILTATRLVFP AAAARTGIVP AGFFPFPADG 

51 LRFVDDRLPV AVDVCQRVRQ FGRKFRQLAF GELQADNAVF LFWNAAHCH 

101 HGVKQLFKRF IIGGFKPIGR HNVQTV KIGV APSVKIAAAL AWV EPQIGQ 

151 LFIRHRGGCF HRHCQNQPFD FGTFGGGKLR FVAQHFGQPV ERCQFVRPAQ 

201 QRRHKTLNLV ATHRVALFAF GIQ* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 525>: 

ml36 . seq 

1 ATGGAAACAA ACGCTTCAAT TCTTACCGCA ACACGCCTTG TATTTTCTGC 

51 CGCTGCCGCA CGGACAGGGA TCGTTCCTGC CTGTTTTTTC GCCTTCCCTG 

101 CGGACGGTTT GCGGTTTGTT GATGACTGCC TGCCAGTAGC GGTAGATATC 

151 CGCCAATGCA TAAGGCAACT CGGATTCCAG TTCCGCCAGC TCGCCTTCTG 

201 TGAATTGCAG ACGGATAGCG CCGTTTTCCT CTTCGTCGTA AATACCGCCC 

251 AATGCCATGA TGGGATAAAA CAACTCTTCA AACGCTTCAT CATCGACGGC 

3 01 TTCAAACCAA TCGGTCGGCA CAATATCCAA AC CGTAAAGA TAAGCATTGC 

3 51 ACCATGTGTA AAAATCGCTG CCGCCGTCTT CGTTTTCATA CAGCCACAAA 

4 01 TCGGGCAGTT TTTTAT CCGA CATCGCGGCG GTTGTTTCCA TCGCCATTGC 
451 CAAAACCAGC CGTTCGATTT CGGAACGTTC GGCGGCGGTA AATTGCGATT 

5 01 CGTCGCCCAA CACTTCGGGC AGCCAGTCGA GCGGTGTCAA TTTGTCCGGC 
551 CCGCTCAACA GCGCCGTCAT AAAAC CTTGA ACCTCGTCGC AACGCATCGT 

6 01 GTTGCCTTGT TCGCTTTTGG CATCCAACAA TTCGCTCAAC CGCCGTTTGG 
651 ATGCTTCGGT AAATTTTCGG GAATCCATCA TTTTCCTTTT CAAATGGGTT 
701 TTGCGCCCTA TTATCGCCGC AATGCCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 526; ORF 136>: 
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ml36.pep 

1 METNAS I LTA TRLVFSAAAA RTGIVPACFF AFPADGLRFV DDCLPVAVDI 

51 RQCIRQLGFQ FRQIAFCELQ TDSAVFLFW NTAQCHDGIK QLFKRFIIDG 

101 FKPIGRHNIO TVKISIAPCV KIAAAVFVFT QPQIGQFFIR HRGGCFHRHC 

151 QNQPFDFGTF GGGKLRFVAQ HFGQPVERCQ FVRPAQQRRH KTLNLVATHR 

201 VALFAFGIQQ FAQPPFGCFG KFSGIHHFPF QMGFAPYYRR NAV* 

Computer analysis of this amino acid sequence gave the following results- 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 136 shows 85.6% identity over a 209 aa overlap with a predicted ORF (ORF 136 ne) 
from N. gonorrhoeae: A ^«-«s^ 

ml36/gl36 



ml36 .pep 
gl36 



10 20 30 40 

METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRFVDDCLPV 

Mllllllllllll NIIIMMli !| INIIIIflll IN 

MEIRFOTAFLRLVQMKTNASILTATRLVFPAAAARTGIVPAGFFPFPADGLRFTODRL 

10 20 30 40 50 60 

50 60 70 80 so inn 

ml36 .pep AyDIRQCIROLGFQFRQIAFCELQTDSAVFLFVVNTAOCHDGIKQLFKRFIIDGFKPIGR 
1 ' ' : 1 ! 1 ' : 1 : I i I I I I I I h I : I I I I | | | | : | : | | I : I I I i I I I I I I I I I I | I 
9136 AVDVCQRVRQFGRKFRQLAFGEIiOADNAVFLFVTOAAHCmGVKQLFJKFIIGG 

70 80 9 ° 100 no 120 

110 120 130 140 150 160 

ml36.pep "NIQTVKISIAPCVKIAAAV^ 

Hill ' I : : I N M = I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
9136 HNVQTVKIGVAPSVKIAAAIAVWEPQIGQLFIRHRGGC^ 

130 140 150 160 170 lao 

170 180 190 200 210 220 

FVAQHFGQPVERCQFVRPAQORRHKTLNLVATHRVALFAFGIQX 
190 200 210 220 



gl36 



230 240 
ml36.pep HFPFQMGFAPYYRRNAVX 



The foUowmg partial DNA sequence was identified in N. meningitidis <SEQ ID 527>: 

1 ATGGAAACAA ACGCTTCAAT TCTTACCGCA ACACGCCTTG TATTTTCTGC 

51 CGCTGCCGCA CGGACAGGGA TCGTTCCTGC CTGTTTTTTC GCCTTCCPTr 

\l\ " m GCGGCTTGTT GATGACCGCC TGCCAGTAGC GGTAGaS? 

151 CGCCAATGCA taaggcaact cggattccag ttccgccagc tcgccttctg 

201 TGAATTGCAG ACGGATAGTG CCGTTGTCCT CTTCGTCGTA AATACCGCCC 

251 AATGCCATGA TGGGATAAAA CAACTCTTCA AACGCTTCAT CATCGACGGC 

301 TTCAAACCAA TCGGTCGGCA CAATATCCAA ACCGTAAAGA TAAgSS^ 

351 accatgtgta aaaatcgctg ccgccgtctt cgttttcata cagccJcIS 

401 tcgggcagtt ttttatccga catcgcggcg gttgtttcca tcgcSSJ? 

451 caaaaccagc cgttcgattt cggaacgttc ggcggcggta aatSgact 

501 cgtcgcccaa cacttcgggc agccagtcga gcggtgtcaa tttctccgg? 

551 CCGCTCAACA GCGCCGTCAT AAAACCTTGA ACCTCGTCGC AACGCATCGT 

601 GTTGCCTTGT TCGCTTTTGG CATCCAACAA TTCGCTCAAC CGCCG^SgG 

651 ATGCTTCGGT AAATTTTCGG GAATCCATCA ttttcctttt ccaatgggS 

701 TTGCGCCCTA TTATAGTGGA TTAAATTTAA ATCAGGACAA 

751 CCGCAGACAG TACAAATAGT ACGGCAAGGC GAGGCAACGC CGTACTGGTT 

801 TAAATTTAAT CCACTATATC GCCGCAATGC CGTCTGA CGTACTGGTT 

This corresponds to the amino acid sequence <SEQ ID 528- ORF 136 a>- 

al36.pep ' 
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1 METNASILTA TRLVFSAAAA RTGIVPACFF AFPADGLRLV DDRLPVAVDI 

51 RQCIRQLGFQ FRQ LAFCELQ TDSAWLFW NTAQCHDGIK QLFKRFIIDG 

101 FKPIGRHNIQ TVK ISIAPCV KIAAAVFVFI QPQIGQFFIR HRGGCFHRHC 

151 QNQPFDFGTF GGGKLRFVAQ HFGQPVERCQ FVRPAQQRRH KTLNLVATHR 

201 VALFAFGIQQ FAQPPFGCFG KFSGIHHFPF PMGFAPYYSG LNLNQDKATK 

251 PQTVQIVRQG EAT PYWFKFN PLYRRNAV* 



ml36/al36 98.3% identity in 238 aa overlap 

10 20 30 40 50 60 

ml36 . pep METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRFVDDCLPVAVDIRQCIRQLGFQ 
I I I M I I I I I I I I M I I I I I II I I I M I I II I I I I I I I : I I I I | I | | j | | | [ | | | u | | 
al36 METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRLVDDRLPVAVDIRQCIRQLGFQ 

10 20 30 40 50 60 



70 • 80 90 100 110 120 

ml3 6 . pep FRQLAFCELQTDSAVFLFWNTAQCHDGIKQLFKRFIIDGFKPIGRHNIQTVKISIAPCV 

1 1 1 1 1 m i ii 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 ii 1 1 r 1 1 1 1 1 1 1 1 1 m 1 1 m 1 1 1 1 1 1 1 1 1 

al3 6 FRQLAFCELQTDSAWLFWNTAQCHDGIKQLFKRFIIDGFKPIGRHNIQTVKISIAPCV 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml3 6 . pep KIAAAVFVFI QPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLRFVAQHFGQPVERCQ 

II I I I I i I I I II I I M I I I I I I I M I I I I I I II I I I I I I I I ! I I | | | M I I I I I II | II | 
al 3 6 KIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLRFVAQHFGQPVERCQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 3 6 . pep FVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIHHFPFQMGFAPYYRR 

I I I I II ! I I II I I I It I I I II II I I II I I ! I I I I I I I I I I I I I I I I I | I | I I I I I I 1 
al3 6 FVRPAQQRRH KTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIHHFPFPMGFAPYYSG 

190 200 210 220 230 240 



ml 3 6. pep NAVX 

al3 6 LNLNQDKATKPQTVQIVRQGEATPYWFKFNPLYRRNAVX 

250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 529>: 

gl37 . seq 

1 ATGATTATCC ATCACcaaTT CGATCCCGTC CTCATCAGTA TCGGCCCGCT 

51 TGCCGTCCGC TGGTATGCCT TAAGCTACAT CCTCGGATTT ATTCTTTTTA 

101 CCTTTCTCGG CAGAAGGCGC ATCGCGCAAG GCTTGTCCGT TTTTACCAAA 

151 GAATCGCTCG ACGACTTCCT GACATGGGGC ATTTTGGGCG TGATTTTGGG 

2 01 CGGACGCTTG GGCTATGTCC TGTTTTACAA ATTCTCCGAC TACCTCGCCC 
251 ATCCGCTTGA TATTTTCAAG GTATGGGAAG GCGGAATGTC GTTCCACGGC 

3 01 GGCTTTTTGG GTGTAGTTAT TGCCATATGG TTGTTCAGCC GCAAGCACGG 

3 51 CATCGGCTTC CTCAAACTGA TGGACACGGT CGCGCCGCTC GTTCCGCTGG 

4 01 GTCTCGCTTC GGGACGTATC GGCAACTTTA TCAACGGCGA ACTTTGGGGA 
451 CGCATTACCG ACATTAACGC ATTTTGGGCA ATGGGCTTCC CGCAAGCGCA 
501 TTACGAAGAT GCCGAAGCCG CCGCGCACAA TCCGCTTTGG GCAGAATGGC 
551 TGCAACAATA CGGTATGCTG CCGCGTCATC CCTCGCAGCT TTATCAGTTT 
601 GCCCTTGAAG GCATCTGCCT GTTCGCCGTC GTTTGGCTGT TTTCCAAAAA 
651 ACCGCGCCCG ACCGGGCAGA CTGCCGCGCT TTTTCTCGGC GGCTACGGCG 
701 TGTTCCGCTT TATTGCCGAA TTTGCGCGCC AACCCGACGA CTATCT CGGG 
751 CTGCTGACCT TGGGGCTGTC GATGGGGCAA TGGTTGAGCG TCCCGATGAT 
8 01 TGTTTTGGGT ATCGTCGGCT TTGTCCGGTT CGGCATGAAA AAACAG CACT 
851 GA 

This corresponds to the amino acid sequence <SEQ ID 530; ORF 137.ng>: 

g!3 7 .pep 
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1 MIIHHQFDPV LISIGPLAVR WYALSYILGF ILFTFLO PPP IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDI FK VWEGGMSFHG 

101 GFLGWIAIW LFSRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RITDINAFWA MGFPQAHYED AEAAAHNPLW AEWLQQYGML PRHPSQLYOF 

201 ALEG I CLFAV VWLFSKKPRP TGQTAALFLG GYGVFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ WLSVPMIVLG IVGFVRFG MK KQH* 

The following partial DNA sequence was identified in N. meningitidis <SEO ID 53 1>- 

ml 3 7 . seq ^ 1 

1 ATGATTACCC ATCCCCAATT CGATCCCGTC CTTATCAGTA TCGGCCCGCT 

51 TGCCGTCCGC TGGTATGCCC TAAGCTACAT CCTCGGATTT ATTCTTTTTA 

101 CCTTTCTCGG CAGAAGGCGC ATCGCGCAAG GCTTGTCCGT TTTTACCAAA 

151 GAATCGCTCG ACGACTTCCT GACATGGGGC ATTTTGGGCG TAATTTTGGG 

201 CGGGCGTTTG GGTTACGTCC TGTTTTACAA GTTTTCCGAC TACCTCGCCC 

251 ATCCGCTTGA TATTTTCAAG GTATGGGAAG GCGGAATGTC GTTCCACGGC 

3 01 GGCTTTTTGG GTGTAGTTAT TGCCATACGG TTGTTCGGCC GCAAACACGG 
351 CATCGGCTTC CTCAAACTGA TGGATACGGT CGCACCGCTC GTTCCGCTGG 

4 01 GTCTCGCTTC GGGACGTATC GGCAACTTCA TCAACGGCGA ACTTTGGGGA 
451 CGCGTTACCG ACATCAACGC ATTTTGGGCA ATGGGCTTCC CGCAGGCGCG 
501 TTACGAAGAT GCCGAAGCCG CCGCGCACAA TCCGCTTTGG GCAGAATGGC 
551 TGCAACAATA CGGTATGCTG CCGCGTCATC CCTCGCAGCT TTATCAGTTT 
601 GCACTTGAAG GCATCTGCCT GTTCACCGTC ATTTGGCTGT TCTCTAAAAA 
651 ACAGCGGTCG ACCGGACAAG TCGCCTCGCT CTTCCTCGGC GGCTACGGCA 
701 TATTCCGCTT CATTGCCGAA TTCGCACGCC AACCCGACGA CTATCTCGGG 
751 CTGCTGACCT TGGGGCTGTC GATGGGGCAA TGGTTGAGCG TCCCGATGAT 
801 TGTTTTGGGT ATCGTCGGCT TTGTCCGGTT CGGCATGAAA AAACAGCACT 
852 GA 

This corresponds to the amino acid sequence <SEQ ID 532- ORF 137>- 

tnl37.pep ' 

1 MITHPQFDPV LISIGPLAVR WYALSYILGF ILFTFT^P Pg IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDI FK VWEGGMSFHG 

101 GFLGWIAIR LFGRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RVTDINAFWA MGFPQARYED AEAAAHNPLW AEWLQQYGML PRHPSQLYOF 

201 ALEGICLFTV IWLFSKKQRS TGQVASLFLG GYGIFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ WLSVPMIVLG IVGFVRFGM K KQH* 

Computer analysis of this amino acid sequence gave the following results- 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 137 shows 95.4% identity over a 283 aa overlap with a predicted ORF (ORF 137 ne) 
from N. gonorrhoeae: y 



ml37/gl37 
ml3 7.pep 



MITHPQFDPyLISIGPLAWWYALSYILGF^ 

" 1 1 1 ' 1 1 ' 1 1 1 1 I I I I I I I I I I I 1 I I I I I I I I I I I I I I I 

9137 ""hhqfdpvlisigpiavrwyalsyilgfilftflgrrrIaqg^ 

10 20 30 40 50 60 

70 80 90 100 HO 120 

' 1 1 M 1 1 1 1 1 II I 1 1 II M 1 1 1 | 1 1 1 1 1 1 1 1 1 1 I I 1 1 I | 1 1 . 1 1 1 1 1 1 | 

ILGVILGGRLGYVLFYKFSDYIiAHPLDIFKVWEGGMSFHGGFLGVVIAIWLFSRKHG 

70 80 90 100 no 120 

130 140 150 160 170 180 

'"'HIMIIIMIIIIII IhlMIIIIIIIIIII-lllllliiniii 

LKLMDTVAPLVPLGLASGRIGNFINGELWGRITDINAFWAMGFPQAH^ 

130 140 150 160 170 iso 

190 200 210 220 

ml37.pe P AEWLQQYGMLPRHPSQLYQFALEGICLFTVIWLFSKKQRSTGQVASLFLGGYGIFRFIAE 



ml3 7.pep 
gi37 

ml3 7.pep 
gl37 
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I I I I II I I I I I I I I I I I I I I I i I I I I I I * 1 = I I I I I I I I I I = h I II I I I I : I I I I I I 
gl3 7 AEWLQQYGMLPRHPSQLYQFALEGICLFAWWLFSKKPRPTGQTAALFLGGYGVFRFIAE 

190 200 210 220 230 240 

250 260 270 280 

ml 3 7 . pep FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 

II I Ml! I I I I I i II Ml I I II II I II II I I I I ! Ml II I III! 

g 1 3 7 FARQPDDYLGLLTLGLSMGQWLSVPMI VLGI VGFVRFGMKKQHX 

250 260 270 280 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 533>: 

al37 . seq 

1 ATGATTACCC ATCCCCAATT CGACCCCGTC CTTATCAGTA TCGGCCCGCT 

51 TGCCGTCCGC TGGTATGCCC TAAGCTACAT CCTCGGATTT ATTCTTTTTA 

101 CCTTTCTCGG CAGAAGGCGC ATCGCGCAAG GCTTGTCCGT TTTTACCAAA 

151 GAATCGCTCG ACGACTTCCT GACATGGGGC ATTTTGGGCG TAATTTTGGG 

201 CGGGCGTTTG GGTTACGTCC TGTTTTACAA GTTTTCCGAC TACCTCGCCC 

2 51 ATCCGCTTGA TATTTTCAAG GTATGGGAAG GCGGAATGTC GTTCCACGGC 

301 GGCTTTTTGG GTGTAGTTAT TGCCATATGG TTGTTCGGTC GCAAACACGG 

351 CATCGGCTTC CTCAAACTGA TGGACACGGT CGCACCGCTC GTTCCACTGG 

4 01 GTCTCGCTTC GGGACGTATC GGCAACTTCA TCAACGGCGA ACTTTGGGGA 

4 51 CGCGTTACCG ACATCAACGC ATTTTGGGCA ATGGGCTTCC CGCAGGCGCG 

501 TTACGAAGAC CTCGAAGCCG CCGCGCACAA TCCGCTTTGG GCAGAATGGC 

551 TGCAACAATA CGGTATGCTG CCGCGTCATC CCTCGCAGCT TTATCAGTTT 

601 GCACTTGAAG GCATCTGCCT GTTCGCCGTC GTTTGGCTGT TCTCTAAAAA 

651 ACAGCGGCCG ACCGGACAAG TCGCCTCACT CTTCCTCGGC GGCTACGGCA 

7 01 TATTCCGCTT CATTGCCGAA TTTGCACGCC AACCCGACGA CTATCTCGGG 

751 CTGCTGACCT TGGGGCTGTC GATGGGGCAA TGGTTGAGCG TCCCGATGAT 

801 TGTTTTGGGT ATCGTCGGCT TTGTCCGGTT CGGCATGAAA AAACAGCACT 

851 GA 

This corresponds to the amino acid sequence <SEQ ID 534; ORF 137.a>: 

al37 . pep 

1 MITHPQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEGG MS FHG 

101 GFLGWIAIW LF GRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RVTDINAFWA MGFPQARYED LEAAAHNPLW AEWLQQYGML PRHPS QLYQF 

201 ALEGICLFAV VWL FSKKQRP TGQVASLFLG GYGIFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFG MK KQH* 

ml37/al37 98.2% identity in 283 aa overlap 

10 20 30 40 50 60 

ml37 . pep MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 
I t I I I I I I M I I I i I i I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I It I t I I II I I I 
a!37 MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml37 . pep ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIRLFGRKHGIGF 
I I I I I I I I I I I I I M I 11 I I I I I II I II I I I I M I I I I I I I I I I I I I I I I I I I IE It I I 
al37 ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIWLFGRKHGIGF 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml37 . pep LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDAEAAAHNPLW 
I I II I I I I I I I I I I I I I I I I I I I I I I I M I M I II I I I I I I I I I I I I I I j I II I I I I I I 
al37 LKLMDT VAPLVPLGLAS GRIGNFINGELWGRVT D IN AFWAMG FPQARYE DLEAAAHN PLW 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml37 . pep AEWLQQYGMLPRHPSQLYQFALEGICLFTVIWLFSKKQRSTGQVASLFLGGYGIFRFIAE 
I i I I I I I M II I I II I I I I I M I I I I 1 I : I : I I II I I I I I I I I I I I I I I I I I I I I I I | | 
al37 AEWLQQYGMLPRHPSQLYQFALEGICLFAWWLFSKKQRPTGQVASLFLGGYGIFRFIAE 
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220 230 240 



250 260 270 280 

ml 3 7 . pep FARQPDDYLGLLTLGLSMGQWLSVPMI VLGIVGFVRFGMKKQHX 

-in ' i ' I 'Hill Mil I I || | IN mil I II MM | | M|| 

al37 FARQPDDYLGLLTLGLSMGQWLSVPMI VLGIVGFVRFGMKKQHX 

250 260 270 280 

S^seq^" 2 Partial ° NA sec * uence was identified in W gonorrhoeae <SEQ ID 535>: 

.l 1 rrttrnTZn ™* CA ™ TTCCGCCGCc gaCAAGGCGC GTATCCTTGC 

n «, ^ AAGCACTG CCTTACAtcc gc cgGTTTTC CGGTTCGGTC GCCGTCATCA 

HI rfrnl^n ^ AACGCGATG ^CGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC OGoSSSS 

25i gcgaatttgt ccaaggaatg cgcgttaccg acaaagagac StoSJSS? 
301 gtcgaaatgg tattgggcgg gcacgtcaac aaggaaatcg totSJSS 

TAACACATAT GGAGGGCACG CGGTCGGCGT GAGCGGGCGC GACGACCATT 
wll^, GAAGAAACTT TTGGTCGATA CGCCCGAACA GAATAGCGTG 
GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 
AGGGCTGATA GAACGCGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 
GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT GGCAGGCAAA 
TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAAtatcgc 
cgGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC acgCCGAAAC 
GGATTGATGG GCTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 
AAAATCGCTT CTGCGGTCGA AGCcgccgtc aACGGTGTGA AAGCCACGCA 
CATCATCGAC GGCAGGTTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 
ATGCCGGTAT CGGGTCGATG ATTTTAGGCA GAGGGGAAGA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 536; ORF 138.ne>: 



351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 



gl3 8 .pep 

1 MEFENIISAA DKARIIAEAL PYIRRFSGSV AVIKYGGNAM TEPALKEGFA 
^Zrn^° IHPVIVH ^ G PQINAMLEKV GKKGEFVQGM RVTDKETMDI 



51 
101 



KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LVDTPEQNSV 
151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 
Hi *£S£^ U ™ 1 * WH ^TGNLLTKL TPKRIDGlS £££25 
251 KIASAVEAAV NGVKATHI ID GRLP NALLLE I FTDAGIGSM Ttr rpn^ 

The foUowing partial DNA sequence was identified in N. meningitidis <SEQ ID 537>: 

xi nl^ 0 ^ 0 AAAACA ™ TTCCGCCGCC GACAAGGCGC GTATCCTTGC 

nn^ CGAAGCGCTG CCTTACATCC GCCGGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AATACGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

Hi SSEK CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCaSHS 

rlnn G lZ G CCAAGGAATG CGCGTTACCG ACAAAGAGGC GATGGATATT 

Hi ^^Zf TGTTGGGCGG GCATGTCAAT AAAGAAATCG TGTCGATGAT 

351 TAACACATAT GGCGGACACG CGGTCGGCGT AAGCGGACGC GACGACCATT 

111 GAAGAAACTT TTGATCGATA CGCCCGAACA GA^SgSg 

GA ^ CGGAC AGGTCGGT AC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

HI ^^ laKSA GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 

lei ^^ AAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT AGCAGGCAAA 

601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAATATCGC 

651 CGGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC ACG^cSaaIc 

751 SSSSf ACTGATTGCC GACGGCACGC TGTATGGCGG iSSSSSS 

751 AAAATCGCTT CTGCGGTCGA AGCCGCCGTC AACGGTGTGA AAGCCACGCA 

80! TATCATCGAC GGCAGGTTGC CCAACGCGCT TTTGCTGGAA A^cSaS 

851 ATGCCGGTAT CGGTTCGATG ATTTTGGGCG GTGGGGAAGA TGCCTGA 

miss p C a ^ esponds to ^ amino aci d sequence <SEQ ID 538; ORF 138>: 

1 MESENIISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM TE PALKEGFA 
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51 R PWLLKLVG IHPVIVHGG G PQINAMLEKV GKKGE FVQGM RVTDKEAMDI 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFI KAKKL LIDTPEQNGV 

151 DI GQVGTVES I DTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 

201 LAEELNAEKL LMMTNIAGVM DKTGNLLTKL TPKRIDELIA DGTLYGGMLP 

251 KI AS AVEAAV NGVKATHIID GRLP NALLLE IFTDAGIGSM IL GGGEPA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 138 shows 98.0% identity over a 298 aa overlap with a predicted ORF (ORF 138.ng) 
from TV. gonorrhoeae: 

ml38/gl38 

10 20 30 40 50 60 

MESENI I SAADKARI LAEALPYIRRFSGSVAVI KYGGNAMTEPALKEGFARDWLLKLVG 

II II Ml M I 1 1 1 I ! M II Ml IN I M 1 1 1 II I IMIMI M II 1 1 1 1 1 1 1 1 1 1 1 : ! I 

ME FENI I SAADKARI LAEALP YI RRFSGS VAV I KYGGNAMTE PALKEGFARDWLLKLVG 
10 20 30 40 50 60 



ml38 .pep 
gl38 



70 80 90 100 110 120 

ml3 8 . pep I HPVI VHGGGPQ I NAMLEKVGKKGE FVQGMRVTDKEAMD I VEMVLGGHVNKE I VSMINTY 

Ml I I M I I M 1 1 1 I I I Ml MM M I M II M I MIMI III II II Ml M I M ! I 1 1 

gl3 8 IHPVIVHGGGPQINAMLEKVGKKGEFVQGMRVTDKETMDI VEMVLGGHVNKE I VSMINTY 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml3 8 . pep GGHAVGVSGRDDHFI KAKKLLIDTPEQNGVDI GQVGTVES I DTGLVKGLI ERGCI PWAP 

Ml MIMMM Ml II MIMIII MM! II MIMMIMIMMIIM II MMMI 

g 1 3 8 GGHAVGVSGRDDHF I KAKKLLVDTPEQNSVDI GQVGTVES I DTGLVKGLI ERGCI PWAP 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 3 8 . pep VGVGEKGEAFNINADLVAGKLAEELNAEKLLMMTNI AGVMDKTGNLLTKLTPKRIDELI A 

Ml Mi Mill MMMMIMMIIMIMI MIMIIIMMIII! MM Ml! Ml 

gl3 8 VGVG E KGEAFN I NADLVAGKLAE E LNAE KLLMMTN I AGVMD KTGNLLTKLT PKRIDGLIA 

190 200 210 220 230 24 0 



250 260 270 280 290 299 

ml38 . pep DGTLYGGMLPKI ASAVEAAVNGVKATHI IDGRLPNALLLEI FTDAGIGSMI LGGGEDAX 

I M II II I II I M II II IIIIMM M I M I II Ml Ml MUM III M I M III 

gl38 DGTLYGGMLPKI ASAVEAAVNGVKATHI IDGRLPNALLLEI FTDAGIGSMILGRGEDAX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 539>: 

al38 . seq 

1 ATGGAGTCTG AAAACATTAT TTCCGCCGCC GACAAGGCGC GTATCCTTGC 

51 CGAAGCGCTG CCTTACATCC GCCGGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AATACGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GTGAGTTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGGC GATGGATATT 

301 GTCGAAATGG TGTTGGGCGG GCATGTCAAT AAAGAAATCG TGTCGATGAT 

351 TAACACATAT GGCGGACACG CGGTCGGCGT AAGCGGACGC GACGACCATT 

4 01 TCATTAAGGC GAAGAAACTT TTGATCGATA CGCCCGAACA GAATGGCGTG 

4 51 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

501 AGGGCTGATA GAACGTGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 

551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT AGCAGGCAAA 

601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAATATCGC 

651 CGGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC ACGCCGAAAC 

701 GGATTGATGA ACTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 

7 51 AAAATCGCTT CTGCGGTCGA AGCCGCCGTC AACGGCGTGA AAGCCACGCA 

801 TATCATCGAC GGCAGGGTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 

851 ATGCCGGTAT CGGTTCGATG ATTTTGGGCG GTGGGGAAGA TGCCTGA 
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This corresponds to the amino acid sequence <SEQ ID 540; ORF 138.a>: 

1 MESENIISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM TE PALKF c P"fi 

£ 3SSSS EKE E™ 

§ SSSS3S 2KSSS = ™| = 

251 KIASAVEftAV NGVKftTHIID GRVPH ALLLE iSSS ig S" 

ml38/al38 99.7% identity in 298 aa overlap 

10 20 30 40 50 60 

- — THPVTVHr^^TK 1 1 M 1 ' 1 1 1 1 1 11 1 M I N I I I j M M | i I I I I M | M ' 



al38 



° 80 90 100 no 120 



ml38.pe P 

^° 140 150 160 170 180 

190 200 210 



a!38 



ml 3 8 .pep 
al38 



200 210 220 

=== 

190 200 2 10 220 230 240 

250 260 270 9 ft ft o or> 

"° 260 270 280 290 



The following^ partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 541>: 

A £££££ GTTGCAACAA SSSSS SSSSSS" 

"i SSSf™ TGCTCCCGAC SSSSE SS2K g£§X2 

151 AACAGCAGGG CAACGATAGC GGAATCAGCA GCAGTATCTT ACGCCGGTA? 

20! aaaaaacgaa atgtgcaaag acagaagcat gctctgtgcc SSg 
251 acgttgcggt tacagacagg gatgccaaaa tcaaagcccc c?g^?Sg? 
3 01 ataccggaga cttttcaaac ccaaatgacc aatattaaga SSSSSf 

I S? SE™"* GCAATTGAAG CAGGCTATAC AGGACgSS 

401 gtatcgtcga tacaggcgaa tccgtcggca gcatatcctt ?ccS2Sg 
451 tatggcagaa aagaacacgg ctataacgaa aattacaaaa acaaa^S 

501 AAAACTATAC GGCGTATATG CGGAAGGAAG CGCCTgT^ ACAAATTACA 

corresponds to the amino acid sequence <SEQ ID 542; ORF 138.ng>: 

* MRTTSTFPTK TFTCP A am& t VATTLSACLG GGGGGTSAPD FNAGGTGXGc 

51 NSRATIAESA AVS YAGI KNE MCKDRSMLCA GRDDVAVTOR ££SS£? 

101 IPETFQTQMT NIKNMINLKP AIEAGYTGRG VEVGIVD^E sSSeL 

151 YGRKEHGYNE NYKNKLQKLY GVYAEGSA* * lvm ** SVGSISFPEL 
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The following partial DNA sequence was identified in N.meningitidis <SEQ ID 543>: 

ACTTTCAAAC CGACTGCCAT 
CTGCTTAGGC GGCGGCGGAG 
GCGGTACCGG TATCGGCAGC 
GCAGTATCTT ACGCCGGTAT 
GCTCTGTGCC GGTCGGGATG 
TCAATGCCCC CCCCCGAATC 
ACGCATtACA AGAATTTGAT 
TACAGGACGC GGGGTAGAGG 
GCAGCATATC CTTTCCCGAA 
GAAAATTACG AAAAACTATA 

This corresponds to the amino acid sequence <SEQ ID 544; ORF 138>: 

ml39 .pep 

1 MRTTPTFPTK TFKPTAMA LA VATTLSACLG GGGGGTSAPD FNAGGTGIGS 

51 NSRATTAKSA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPRI 

101 CIPETFQTQM THYKNLINLK PAI EAGYTGR GVEVGIVDTG ESVGSISFPE 

151 LYGRKEHGYN ENYEKLYGVY AEGSA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 138 shows 92.2% identity over a 179 aa overlap with a predicted ORF (ORF 138.ng) 
from TV. gonorrhoeae: 

ml39/gl39 



m!3 9 . seq 








1 


ATGCGAACGA 


CCCCAACCTT 


CCCTACAAAA 


51 


GGCGTTAGCT 


GTTGCAACAA 


CACTTTCTGC 


101 


GCGGCACTTC 


TGCGCCCGAC 


TTCAATGCAG 


151 


AACAGCAGAG 


CAACAACAGC 


GAAATCAGCA 


201 


CAAGAACGAA 


ATGTGCAAAG 


ACAGAAGCAT 


251 


ACGTTGCGGT 


TACAGACAGG 


GATGCCAAAA 


301 


TGCATACCGG 


AGACTTTCCA 


AACCCAAATG 


351 


CAACCTCAAA 


CCTGCAATTG 


AAGCAGGCTA 


401 


TAGGTATCGT 


CGACACAGGC 


GAATCCGTCG 


451 


CTGTATGGCA 


GAAAAGAACA 


CGGCTATAAC 


501 


CGGCGTATAT 


GCGGAAGGAA 


GCGCCTGA 



10 20 30 40 50 60 

ml 3 9 . pep MRTTPTFPTKTFKPTAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKS A 

I I I I II I Ml I I hi M I Mill Ml Mill I I III I M 1 IN I MM Mil I I MM 

gl3 9 MRTTSTFPTKTFKPAAMAIjAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATIAESA 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 3 9 . pep AVSYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETFQTQMTHYKNLINLK 

I 1 11 1 1 1 1 1 1 E 1 1 1 1 ! 1 1 ! 1 1 1 1 1 1 1 1 1 ! M 1 1 1 = I ! I 1 1 1 1 1 1 ! I i 1 1 1 : I i = 1 1 1 I 

gl3 9 AVSYAGI KNEMCKDRSMLCAGRDDVAVTDRDAKI KAP - RI CI PETFQTQMTNI KNMINLK 

70 80 90 100 110 



130 140 150 160 170 
ml3 9 . pep PAI EAGYTGRGVEVGI VDTGE SVGS I S F PEL YGRKEHGYNENY EKLYGVYAEGSAX 

I II MM I III M I M M I MM Mill MM M MM 1 1 ih MMIMMMM 

gl3 9 PAI EAGYTGRGVEVGI VDTGESVGS I S FPELYGRKEHGYNENYKNKLQKLYGVYAEGSAX 

120 130 140 150 160 170 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 545>: 

al39 . seq 

1 ATGCGAACGA CCCCAACCTT CCCTACAAAA ACTTTCAAAC CGGCTGCCAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAGGC GGCGGCGGAG 

101 GCGGCACTTC TGCGCCCGAC TTCAATGCAG GCGGCACCGG TATCGGCAGC 

151 AACAGCAGGG CAACAACAGC GAAATCAGCA GCAATATCTT ACGCCGGTAT 

201 CAAGAACGAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAATGCCCC CCCCCGAATC 

301 TGCATACCGG AGACTTTACA AACCCAAATG ACGCAT . AC A AGAATTTGAT 

351 CAACCTCAAA CCTGCAATTG AAGCAGGCTA TACAGGACGC GGGGTAGAGG 

4 01 TAGGTATCGT CGACACAGGC GAATCCGTCG GCAGCATATC CTTTCCCGAA 

451 CTGTATGGCA GAAAAGAACA CGGCTATAAC GAAAATTAC . AAAAACTATA 

501 CGGCGTATAT GCGGAAGGAA GCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 546; ORF 139.a>: 

al39 .pep 

1 MRTTPTFPTK TFKPAAMALA VATTLSA CLG GGGGGTSAPD FNAGGTGIGS 
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im ™ TTM5SA AISYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPRI 

?v^3^° TQM THXKNLINL * PAIEAGYTGR GVEVGIVDTG ESVGSISFPE 
151 LYGRKEHGYN ENYXKLYGVY AEGSA* Pi 

ml39/al39 97.1% identity in 175 aa overlap 

»139.p.p MRTTPTFPTKTF^ 



al39 

ml39.pep 
al39 



1 1 11 11 11 1 1 1 1 1 1 : 1 1 1 1 11 I I" I I I I I I I I I I I I I I I I I I I I I I I I II II I II M II I 
^TTPTFPTCTFKPAAMAIAVATTLSA^ 

10 20 3 0 40 50 60 

70 80 90 100 110 ion 

1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 11 I M I I I ) I i M t M M I I M M I I • I I M M i i i i i m 
AISYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIP^ 

70 80 90 100 no 120 

mi39 .pep pai eagytgrgvevgivotgesvgsisfpLygrkehgyLnyeklygwaegsax 
al39 paieagytgrgvevgivdtgesvgsisfpelygrkehgynenyxklygvyaegsax 

130 140 150 160 170 



The foUowmg partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 547>: 

Jt **%tnn%Z~ ^ CGGCAAGGG GGCAGgctat c t CAACAGTA CCGGACGACa 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCAGGATTAT TCTTTCTTCA 
III AACCaR «»C GGTCTGCTGG CTTCCCTCGA CAGCGTCGAA 

III ™™ GG GCAGTGAAGG CGACACGCCG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGCTC 

301 gagctggatg cctccgaatc atccgcaaca cccgagacgg SaaSg? 

351 ggtcgccgac cgcacagata tgccgggcat ccgcctacgg cgoSSSJ 

401 TCCGCACAGC GGCAGCCGTA CAGCATGCGA ATACCGCcS 

451 aTCTTcaaCA GTCTCGCCGC TAccgTCTAt GcogACAGTG CCGCCGCCC^ 

501 TGccgATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 

111 ISSSSf GGGTCTGCGC ^CATCGCGC AAACCCAACA gScgSgS 

601 ACGTGGGAAC AGGGCGGTGT CGAAGGCAAA ATGCGCGGCA GTACCCAAAC 

651 TATCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCcSSSc 

701 TGGGCATAGG ACGCAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAaSc 
III GTCTGTTTGC AGGCATACGG CACGATGTGG GCG^GG 

801 CTATCTCAAA GGCCTGTTCT CctaCGGACG CTACAAAAAC AGCATCAGCC 
III TGCGGATGftA TATGCGGAAG GCAGCGTCAA CGgSSctg 

901 ATGCAGCTGG GCGCACTGGG TGGTGTCAAC GTTCCGTTTG CCGCAACGGG 

951 AGATTTGACG GTTGAAGGCG GTCTGCGCCA CGACCTGCTC aSS^Sg 

1001 CATTCGCCGA AAAAGGCagt GCTTTGGGCT GGAGCGGCAA SSSSJS 

1051 GAAGGCACAC TGGTCGGACT CGCGGGTCTG AAACTGTCGC AACcSSS 

All SSSSS 0 GTCCTGTCTG CGACGGCGGG CGTGgSgC £££S£ 

1151 gacgcgacta cgcggtaacg ggcggcttta ccggcgcggc tgcagcaacc 

1201 GGCAAGACGG GTGCACGCAA TATGCCGCAC ACCCGCCGGG TlSSSiS 
TGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCAcSS 

mi ssss ^sr*° tacggcaacc acagcggaca 

This corresponds to the amino acid sequence <SEQ ID 548; ORF 140.ng>: 



51 
101 
151 
201 



1 MSARGKGAGY LNSTGRHVPF LSAAKIGQDY SFFKNIKTDG GLLASLDSVE 
SSST™ SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ SSEE 
ELDASESSAT PETVETAVAD RTDMPGIRLR RTTFRTAAAV OHAOTADGVR 



IFNSLAATVY ADSAAAHADM QGR^^ GLDHNGTGI^ SSSS 
TWEOGGVEGK MRGSTQTIGI AAKTGENTTA AATLGI GRST 
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251 DSISLFAGIR HDVGDIGYLK GLFSYGRYKN SISRSTGADE YAEGSVNGTL 

3 01 MQLGALGGVN VPFAATGDLT VEGGLRHDLL KQDAFAEKGS ALGWSGNSLT 
351 EGTLVGLAGL KLSQPLSDKA VLSATAGVER DLNGRDYAVT GGFTGAAAAT 

4 01 GKTGARNMPH TRRVAGLGVD VEFGNGWNGL ARYSYTGSKQ YGNHSGQIGV 
451 GYRF* 

The following partial DNA sequence was identified in N.meningitidis <SEQ ID 549>: 

ml40 . seq 

1 ATGTCGGCAC GCGGCAAGGG GGCAGGCTAT CTCAACAGTA CCGGACGACG 

51 TGTTCCCTTC CTGAGTG CCG CCAAAATCGG GCAGGATTAT TCTTTCTTCA 

101 CAAACATCGA AACCGACGGC GGCCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GCAGTGAAGG CGACACGCTG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

3 01 GAACTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 
351 GGCAGCCGAC CGCACAGATA TGCCGGGCAT CCGCCCCTAC GGCGCAACTT 

4 01 TCCGCGCAGC GGCAGCCGTA CAGCATG CGA ATGCCGCCGA CGGTGTACGC 
451 ATCTTCAACA GTCTCGCCGC TACCGTCTAT GCCGACAGTA CCGCCGCCCA 
501 TGCCGATATG CAGGGACGCC GCCTGAAAGC CGTAT CGGAC GGGTTGGACC 
551 ACAACGGCAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 
601 ACGTGGGAAC AGGGCGGTGT TGAAGGCAAA ATGCGCGGCA GTACCCAAAC 
651 CGTCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 
701 TGGGCATGGG ACGCAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 
751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGCGG GCGATATCGG 
8 01 CTATCTCAAA GGCCTGTTCT CCTACGGACG CTACAAAAAC AGCATCAGCC 
851 GCAGCACCGG TGCGGACGAA CATGCGGAAG GCAGCGTCAA CGGCACGCTG 
901 ATGCAGCTGG GCGCACTGGG CGGTGTCAAC GTTCCGTTTG CCGCAACGGG 
951 AGATTTGACG GTCGAAGGCG GTCTGCGCTA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCAGT GCTTTGGGCT GGAGCGGCAA CAGCCTCACT 

1051 GAAGGCACGC TGGTCGGACT CGCGGGTCTG AAGCTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTTTG CAACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CACGGTAACG GGCGGCTTTA CCGGCGCGAC TGCAGCAACC 

1201 GGCAAGACGG GGGCACGCAA TATGCCGCAC ACCCGTCTGG TTGCCGGCCT 

12 51 GGGCG CGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 

13 01 GCTACGCCGG TTCCAAACAG TACGGCAACC ACAGCGGACG AGTCGGCGTA 
13 51 GGCTACCGGT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 550; ORF 140>: 

ml4 0.pep 

1 MSARGKGAGY LNSTGRRVPF LSAAKIGQDY SFFTNIETDG GLLASLDSVE 
51 KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 
101 ELDASESSAT PETVETAAAD RTDMPGIRPY GATFRAAAAV QHANAADGVR 
151 IFNSLAATVY ADSTAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG 
201 TWEQGGVEGK MRGSTQTVGI AAKTGENTTA AATLGMGRST WSENSANAKT 
251 DSISLFAGIR HDAGD I GYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL 
301 MQLGALGGVN VPFAATGDLT VEGGLRYDLL KQDAFAEKGS ALGWSGNSLT 
351 EGTLVGLAGL KLSQPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT 
401 GKTGARNMPH TRLVAGLGAD VEFGNGWNGL ARYSYAGSKQ YGNHSGRVGV 
451 GYRF * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from 7V. gonorrhoeae 

ORF 140 shows 94.5% identity over a 454 aa overlap with a predicted ORF (ORF 140.ng) 
from N. gonorrhoeae: 

ml40/gl40 

10 20 30 40 50 60 

ml4 0 . pep MSARGKGAGYLNSTGRRVPFLSAAKIGQDYSFFTNIETDGGLLASLDSVE KTAGSEGDTL 

I MIMi i 1 1 li I M M 1 1 Mil III II I IN I M I I 1 1 III II I ! I 1 1 U ! I M I 

914 0 MSARGKGAGYLNSTGRHVPFLSAAKIGQDYSFFKNIKTDGGLLASLDSVEKTAGSEGDTP 

10 20 30 40 50 60 

70 80 90 100 110 120 
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ml4 0 .pep 
gi40 

ml 4 0 .pep 
gi4 0 



Mill- II Mil 1 1 IIMIIIIIIllMINIIIliiiiiiiiiiiiiTTT 

70 80 9 ° ioo 110 120 

' ' 11 1 1 1 1 : 1 1 1 : 1 1 1 1 1 : II 1 1 1 1 1 1 1 1 1 • 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

RTDMPGI^TTFRTA^^ 

130 140 "0 160 l 7 o 180 

190 200 210 220 

190 200 21 ° 220 230 240 

WSENSANAKTDSISLFAGIIWDVGDIGYLKGLFSYGR^ 

250 260 270 280 290 300 

ml40.pep ^1*^?^**?^^*^ ^^^^l'^' T"^^^?*?^^ ^^^* *P?^ J ^"^^^? KQDA EVAETKGS^^Ij GWS S TfVG Luft-GL 

I' 1 '] 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 nil 

320 330 340 350 360 



ml 4 0 . pep 
9140 



gi4o 



ml40 .pep 

^Oplsd^vlsatagverbi^^^ 

370 380 3 9° 400 410 420 



ml40 . pep 
gl40 



430 440 450 

VEFGNGWNGLARYSVAGSKQYGNHSGRVGVGYRFX 

NMMMMil!l|:||MII!ll|::|||||M 

VEFGNGWNGIiARYS YTGS KQYGNHSGQIGVGYRFX 
430 440 450 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 551>: 

1 ATGTCGGCAG GCGGTAAGGG GGCAGGCTAT CTCAACfTTA rrrr-^r,r-r~ 
51 TGTTCCCTTC CTGAGTGCCG CCAAAAtSg S3SSSJ SSSS 

loi caaacatcga aaccgacggc ggtctgctgg cttccctcga SgcgSgS 
151 aaaacagcgg gtagtgaagg cgacacgctg tcctattatJ 5JSccr^ 
201 caatgcggca cggactgctt cggcagcggc acScSS! cccg?cgg?c 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA cScSIrTr 
301 GAACTGGATG CCTCCGAATC ATCCGCAACA CCCGAgSS SgSJSgC 

SS SEES SSSSS 2SSSS sEE 
££££ = SSSK SSSS I 

551 ACAACGCTAC GGGTCTGCGC GTCATCGCGC AMCCCAaSa GGA?ggSg£ 

ill SS SS ^ a3caM «<*=3S ScSS 

;s iiE ss ssssss ss sssssss 

III SS g£K» 

851 GCAGCACCGG TGCGGACgS SgSS ESSS!"* "^CAGCC 
». ATGCAGCTGG GCGCAcSg SSSSS SSSS 
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951 AGATTTGACG GTCGAAGGCG GTCTGCGCTA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCAGT GCTTTGGGCT GGAGCGGCAA CAGCATCACT 

1051 GAAGGCACAC TGGTCGGACT CGCGGGTCTG AAGCTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTTTG CAACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CACGGTAACG GGCGGCTTTA CCGGCGCGAC TGCAGCAACC 

1201 GGCAAGACGG GGGCACGCAA TATGCCGCAC ACCCGCCTGG TTGCCGGTCT 

1251 GGGCGCGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 

1301 GCTACGCCGG TTCCAAACAG TACGGCAACC ACAGCGGACG AGTCGGCGTA 

1351 GGCTACCGGT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 552; ORF 140.a>: 

al4 0 .pep 

1 MSAGGKGAGY LNRTGQRVPF LSAAKIGRDY SFFTNIETDG GLLASLDSVE 

51 KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 

101 ELDASESSAT PETVETAAAD RTDMPGIRPY GATFRAAAAV QHANAADGVR 

151 IFNNLAATVY ADSTAAHADM QGRRLKAVSD GLDHNATGLR VIAQTQQDGG 

201 TWEQGGVEGK MRGSTQTVGI AAKTGENTTA AATLGMGHST WSENSANAKT 

2 51 DSISLFAGIR HDAGDIGYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL 

301 MQLGALGGVN VPFAATGDLT VEGGLRYDLL KQDAFAEKGS ALGWSGNSIT 

351 EGTLVGLAGL KLSQPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT 

4 01 GKTGARNMPH TRLVAGLGAD VEFGNGWNGL ARYSYAGSKQ YGNHSGRVGV 

4 51 GYRF* 

ml40/al40 98.2% identity in 454 aa overlap 

10 20 30 40 50 60 

ml 4 0. pep MSARGKGAGYLNSTGRRVPFLSAAKIGQDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 
Ml I I I I I I I I M : I I I I I I I I I I I : I M I I I I I II I M I I ! I I I I I I I I M t I I I I I 
al4 0 MSAGGKGAGYLNRTGQRVPFLSAAKIGRDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 4 0 . pep SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASESSATPETVETAAAD 
I I i I I I I I I I M i I I I I I It ! I I I II I M M M I II I I I I I I I I I I I I I I ! I II I II I I I 
al 4 0 SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASES SAT PETVETAAAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml40 . pep RT DM PG I RP YGAT FRAAAAVQHANAADGVRI FNS LAAT VYADSTAAHADMQGRRLKAVS D - 

I I I I I I M I I I I I I I 1 I I I I 1 I I I II I I I I I II : I I I I I I i I I I I I I I I I I M I I I M I r 
a 1 4 0 RT DM PG I RP YGAT FRAAAAVQHAN AADGVR I FNNLAATV YAD S T AAHADMQGRRLKAV S D 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 4 0 . pep GLDHNGTGLRVI AQTQQDGGTWEQGGVEGKMRGSTQTVGI AAKTGENTTAAATLGMGRST 

I I I M : I M I I I I I I I ! I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I M : I I 
al 4 0 GLDHNATGLRVIAQTQQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGHST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml40 . pep WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 
I I I I I I I I M I I M I I I I I I I I I II I I I I I I I I I I I M I | I | | | | | || | | | | | | | | | | M 
al4 0 WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 4 0 . pep MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSLTEGTLVGLAGL 
I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I i I I I I I : I I I I I M I I ! I 
a!4 0 MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSITEGTLVGLAGL 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 4 0 . pep KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 
f I I I I I M I I I i I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I ! I I I I I 
a!4 0 KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 
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370 380 390 

430 440 450 

ml 4 0 . pep VEFGNGWNGLARYSYAGSKQYGNHSGRVGVGYRFX 
1 1 1 ! 1 1 1 1 1 1 1 ' ' I I I I i I | | | | | | | | | | | J | | | , 
al 4 0 VEFGNGWNGLARYSYAGSKQYGNHSGRVGVGYRFX 

430 440 450 



400 410 420 



The following partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 553>: 

1 atgagcttca aAAc cgATGC CGAAACCGCC CAATCCTCCA CCATGCarrr 

51 GATTGGCGAA ATTGCCGCCA AGCTGGGTTT GAACGTTGAc" SISgAGC 

101 CTTACGGTCA TTACAAAGCC AAAATCAATC CTGCCGAAGC G^cSS 

151 CCGCAAAAAC AAGGCAGGCT GATTTTGGTT ACCGCCAtS aSSISS 

201 ggcgggcgaa ggcaaaacca ccgtaaccat cggtttggS g^cgSSgc 

251 GCCATATCGG CAAAGACTCT GTGATTGCTT TOOoSS SSSgS 

301 ccggtgttcg gcgtgaaagg cggcgcggca ggcggcggct aSSSSS 

J" !!^ CCGATG ^ACATCA ACCTGCACTT CACCGgSS JSSSSS 

401 tcggtgcggc gaataacctc ctcgccgcca tgctcgacaa ccaStcS 

501 SS£5i£ AGTTGAACAT CGACCCCAAA CGCGTGCTGT S£5£5£ 

III SSSf RAC ^ CCOCC AGTTGCGCAA CATCATCGAC GGTATGGG^ 

551 AGCCTGTtga cggCGTGATG CGtcccGACG GCTTCGACAT CACCBTCCCr 

601 TCCGAAGTGa tggcgGTATT CTGCCTTGCC AAAGaSSI SSSgS 

651 AGAGCGTTt t gGCAATATTC TCGTCGCCTA CGCCAAAGAC gSSc^ 

70i tttacgccaa agatttgaag gcacacggcg cgatggcggc a?™S£2 

751 GATGCGATTA AGCCCAATTT GGTGCAAACC ATCGaSS SSSSSJ 

801 TGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC aIScCgS 
CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCOGT aISSS 

901 GGCTTCGGCG CGGACTTGGG TGCGGAAAAA TTCTGCGACA TCAAATGCCG 

951 CCTTGCCGGT TTGAAACCTG ATGCGGCAGT CGTCGTGGCG ACtSJSJS 

1001 CCCTGAAATA CAACGGCGGC GTGGAACGCG CCAACcSS SaSaS 

1051 CTCGAAGCCT TGGCAAAAGG TTTGCCCAAC CTGTTGAAAC IStS^ 

1101 CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC J£SSS£ 

1151 TGTCCGACTC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCgS 

1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG SSSSSJ 

1251 CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA TGcS^C aSSSa 

™ ^ ACTTCGG GATGTCGAGT TGGGCATCAA aSaSc 

1351 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTCG ATTTCA^GC 

lill G ^ GCGTCT GCCSAAA ^G CCTCGCTGGA AAAACTgS 

US ^™ ATCTG CATGGCGAAA ACCCAATATT CATTGAGCGA CAACgSa^ 

llil ZnZ GGGCT GCCCCGAA GG CTTCCGCATC GCCGTACGCG GTAtSSg^ 

1551 ttccgccggc gcgggcttca tcgttgcgtt gtgcggcaat atgatgSaa 

1601 TGCCGGGCCT GCCGAAAGTT CCGGCTGCCG AGAAAATCGA TaSaSJJJ 
1651 CACGGCGTGA TTCACGGCTT GTTCTGA AQAAAATCGA TGTGGACGAA 

This corresponds to the amino acid sequence <SEQ ID 554; ORF 141.ng>: 

1 MS FKTDAETA QSSTMRPIGE IAAKLGLNVD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAINPTPAGE GKTTVTI GLA DALRHIGKDS 

101 PVPGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAaS SJJSSy' 

151 QGNELNIDPK RVLWRRWDM NDRQLRNI ID GMGKPVDGVM JJSSESJI 

201 SEVMAVFCLA KDISDLKERF GUI LVAYAKD GS PVYAKDLK SJwSsSS 

251 DAIKPNLVQT IEGTPAFVHG GPFAN1AHGC NSVTAtSS 

301 GFGADLGAEK FCD I KCRLAG LKPDAAVWA TVRALKYNGG 

III K^f LPN LLKHISNLKN VFGLPVWAL SSSS 2S5SS 
4 01 HGVEVSLTEV WGKGGAGGAD IiARKWNAID NQPNNFGFAY DVELGIKDKT 
451 RAIAQKVYGA EDVDFSAEAS AE I ASLEKLG UhSSSS 

III avrgi ™ ag ^ 

The following partial DNA sequence was identified m N. meningitidis <SEQ ID 555>: 
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ml 41 . seq 

1 ATGAGCTTCA AAACCGATGC CGAAATCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTTGGTCT GAATGCCGAC AACATTGAGC 

101 CTTACGGTCA TTACAAGGCG AAAATCAATC CTGCCGAAGC GTTCAAACTG 

151 CCGCAAAAAC AGGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 

201 GGCGGGCGAA GGCAAAACCA CCGTAACCAT CGGTTTGGCG GACGCGTTGC 

251 GCCACATCGG CAAAGATGCC GTGATTGCCC TGCGCGAACC TTCTCTGGGG 

301 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 

351 TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGAGAT TTTCACGCCA 

4 01 TCGGTGCGGC AAATAATCTG CTTGCCGCGA TGCTCGACAA CCATATCTAC 

451 CAAGGCAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 

501 GGTCGATATG AACGACCGCC AGTTGCGCAA CATCATCGAC GGCATGGGTA 

551 AACCCGTTGA CGGCGTGATG CGTCCTGACG GTTTCGATAT TACCGTTGCT 

601 TCCGAAGTGA TGGCGGTATT CTGTCTTGCC AAAGACATCA GCGATTTGAA 

651 AGAGCGTTTG GGCAACATCC TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 

701 TTTACGCCAA AGATTTGAAA GCGAATGGCG CGATGGCGGC ATTGCTTAAA 

751 GATGCGATTA AGCCCAACTT GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 

801 CGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTAA 

851 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 

901 GGCTT CGGCG CGGACTTGGG CGCGGAAAAA TTCTGCGACA TCAAATGCCG 

951 CCTTGCCGGT TTGAAACCTG ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 

1001 CGTTGAAATA TAACGGCGGC GTGGAACGCG CCAACCTCGG CGAAGAAAAT 

1051 TTAGACGCTT TGGAAAAAGG TTTGCCCAAC CTGCTGAAAC ACATTTCCAA 

1101 CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC AACCGC TTCG 

1151 TGTCCGACGC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGC CGAA 

1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GTGGTG CGGG 

1251 CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA CGCCATTGAA AGTCAAACCA 

13 01 ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AGACAAAATC 
1351 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTTG ATTTCAGCGC 

14 01 GGAAGCGTCT GCCGAAATCG CTTCACTGGA AAAACTGGGC TTGGACAAAA 
1451 TGCCGATCTG CATGGCGAAA ACCCAATACT CTTTGAGCGA CAACGCCAAA 
1501 CTGTTGGGCT GCCCCGAAGA CTTCCGCATC GCCGTGCGCG GCATCACCGT 
1551 TTCCGCAGGC GCAGGTTTCA TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 
1601 TGCCCGGCCT GCCCAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGCA 
1651 GAAGGCGTGA TTCACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 556; ORF 141>: 

ml41.pep 

1 MS FKTDAE I A QSSTMRPIGE IAAKLGLNAD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAINPTPAGE GKTTVTI GLA DALRH I GKDA VIALREPSLG 

101 PVFGVKGGAA GGGYAQVXiPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRNIID GMGKPVDGVM RPDGFDITVA 

201 SEVMAVFCIiA KDISDLKERL GNILVAYAKD GSPVYAKDLK ANGAMAALLK 

251 DAI KPNLVQT IEGTPAFVHG GPFANIAHGC NSVTATRIiAK HLADYAVTEA 

3 01 GFGADLGAEK FCDIKCRLAG LKPDAAVWA TVRALKYNGG VERANLGEEN 

3 51 LDALEKGLPN LLKHISNLKN VFGLPVWAL NRFVSDADAE LAMIEKACAE 

4 01 HGVEVSLTEV WGKGGAGGAD LARKWNAIE SQTNNFGFAY DVELGIKDKI 
4 51 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LDKMPICMAK TQYSLSDNAK 
501 LLGCPEDFR I AVRG I TVS AG AGFIVA LCGN MMKMPGLPKV PAAEKIDVDA 
551 EGVIHGLF* 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N.gonorrhoeae 

ORF 141 shows 97.5% identity over a 558 aa overlap with a predicted ORF (ORF 141.ng) 
from N. gonorrhoeae: 

ml41/gl41 
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The Rowing partial DNA sequence was identified in N. meningitidis <SEQ ID 557>: 

1 ATGAGTTTCA AAACCGATGC CGAAATGGCC CAATCCTCCA CCATGCGCCC 

10 SSSSX SSS AGCTGGGTTT GAACGTTGAC SaSgAGC 

t" ^1^. TT ACAAAGCC AAAATCAATC CTGCCGAAGC GTTCAAACTG 

151 CCGCAAAAAC AGGGCAGGCT GATTTTGGTT ACCGCCATCA SccSc? 
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GGCGGGCGAA GGTAAAACCA CCGTAACCAT CGGTTTGGCG GACGCATTGC 
GCCATATCGG CAAAGACTCT GTGATTGCTT TGCGCGAGCC TTCTTTGGGT 
CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 
TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGAGAT TTTCACGCCA 
TCGGTGCGGC AAATAATCTG CTTGCCGCGA TGCTCGACAA CCATATCTAC 
CAAGGCAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 
GGTCGATATG AACGACCGCC AGTTGCGCAA CATCATCGAC GGCATGGGCA 
AGCCTGTTGA CGGCGTGATG CGTCCTGACG GTTTCGATAT TACCGTTGCT 
TCCGAAGTGA TGGCGGTATT CTGTCTTGCC AAAGACATCA GCGATTTGAA 
AGAGCGTTTG GGCAACATCC TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 
TTTACGCCAA AGATTTGAAA GCGAATGGCG CGATGGCGGC ATTGCTTAAA 
GATGCGATTA AGCCCAACTT GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 
CGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTAA 
CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 
GGCTTCGGCG CGGACTTGGG CGCGGAAAAA TTCTGCGACA TCAAATGCCG 
CCTTGCCGGT TTGAAACCTG ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 
CGTTGAAATA TAACGGCGGC GTGGAACGCG CCAACCTCGG CGAAGAAAAT 
TTAGACGCTT TGGAAAAAGG TTTGCCCAAC CTGCTGAAAC ACATTTCCAA 
CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 
TGTCCGACTC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 
CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GTGGTGCGGG 
CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA CGCCATTGAA AGTCAAACCA 
ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AG AC AAAAT C 
CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTTG ATTTCAGCGC 
GGAAGCGTCT GCCGAAATCG CTTCACTGGA AAAACTGGGC TTGGACAAAA 
TGCCGATCTG CATGGCGAAA ACCCAATACT CTTTGAGCGA CAACGCCAAA 
CTGTTGGGCT GCCCCGAAGA CTTCCGCATC GCCGTGCGCG GCATCACCGT 
TTCCGCAGGC GCAGGTTTCA TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 
TGCCCGGCCT GCCCAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGCA 
GAAGGCGTGA TTCACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 558; ORF 141.a>: 

a!41 . pep 

1 MSFKTDAEIA 

51 PQKQGRLILV 

101 PVFGVKGGAA 

151 QGNELNIDPK 

201 SEVMAVFCLA 

251 DAIKPNLVQT 

301 GFGADLGAEK 

351 LDALEKGLPN 

4 01 HGVEVSLTEV 

4 51 RAIAQKVYGA 

501 LLGCPEDFRI_ 

551 EGVIHGLF* 

m!41/al41 99.5% identity in 558 aa overlap 
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The fo g ll^g g Partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 559>: 

ci l^^t ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCTTC TATTTCAAGT TGTCCCGTTT TGCCGCGATG CCAAATATGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG SSUtoS? 

151 GGCAACATCC TGATGTTCGT CCGCCAGCAT ATTGATGCAG 

201 TTTCCGACAG GATcggaATG AttCgCGCAC TCCGGTTTAT gSSSSc 

251 acggtcggcg gctcgtcggt aaccggcgca accgccgtca t£££S£ 

ll\ S^ CCT GCCGCACCGT CTGTCGTGAT GACATGAACG SSSS 

351 AGGATGCCAT CGCATCACGG AACGAAGTTT GAAAAGTTTT CTGCAAATCC 

401 GCCATTTTTC CCCTTTAAAC CGTCCCCTAT ATAAGAATGC TG^SSg 

451 GCATCCCCCC ATGTGCAGCA GTTCTGA 1UOACACAAG 

This ^e s PO nd s to the amino acid sequence <SEQ ID 560; ORF 142.ng>: 

1 MRADFMFADN MPVQVRQRAF YFKLSRFAAM PNMVGKPLFG RQAGOPGKMF 

51 GNILMFVRQH IDAEAAVFRQ DRNDSRTPVY AQHHGRRLVG nSSSSI 

«1 L T S|f DMNACRTGCH RITERSLKSF L QXR„FSpS £££££ 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 56 1>: 
ml42 . seq 

1 ATGCGTGCCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCCTC T ATTT CAAGT TGTCCCGTTT TGCCGCGATG CCAGATGTGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCGT ATTGATGCAG AGGCTGCCGT 

201 TTTCCGACAG GATCGGAATG ATTCGCGCAC TCCGGTTGAT GCACAGCATC 

251 ACGGTCGGCG GCTCGTCGGT AACCGGCGCG ACCGCCGTCA TTGTAATGCC 

3 01 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCGC 
351 AAGATGCCAT CGCATCACGG AACGAAGTTT GAAAATTTTT CTGCAAATCC 

4 01 GCCATTTTTC CCCT TTAAAC TGTCCCCTAT ATAAGAATGC TGCACACAAG 
451 GCATCCCCcC ATGTGCAGCA GTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 562; ORF 142>: 

ml42.pep 

1 MRADFMFADN MPVQVRQRAL YFKLSRFAAM PDWGKPLFG RQAGQPGKMF 
51 GNILMFVRQR I DAE AAVFRQ DRNDSRTPVD AQHHGRRLVG NRRDRRHCNA 
101 VTPCRTVCRD DMNACRARCH RITERSLKIF LQIRHFSPLN CPLYKNAAHK 
151 ASPHVQQF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 142 shows 93.7% identity over a 158 aa overlap with a predicted ORF (ORF 142.ng) 
from N. gonorrhoeae: 
ml42/gl42 
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The following partial DNA sequence was identified in K meningitidis <SEQ ID 563>: 

al42 . seq 

1 ATGCGTGCCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCCTC TATTTCAAGT TGTCCCGTTT TGCCGCGATG CCAGATGTGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCGT ATTGATGCAG AGGCTGCCGT 

201 TTTCCGACAG GATCGGAATG ATTCGCGCAC TCCGGTTGAT GCACAGCATC 

251 ACGGTCGGCG GCTCGTCCGT AACCGGCGCA ACCGCCGTCA TTGTAATGCC 

301 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCAC 

351 AGGATGCCAT CGCATCACGG AACGAAGTTT GAAAAGTTTT CTGCAAATCC 

401 GCCATTTTTC CCCTTTAAAC TGTCCCCTAT ATAAGAATGC TGCACACAAG 

451 GCACCCCCCA TGTGCAGCAG TTCTGATTCA AAAAGCCGTC GGTCGGACAT 

501 TTCCGCGCGT TACGGCGTAT TACGAGTTCA ACGCATCCTC GATTTTGGCA 

551 AGTTCTGCCA ACAGGTCTTT AAGCAGCAGC ATTTTCTCGC GGCCCAGCAC 

601 TTCCTCGATA GCGTCGTAAC GCTCGTCCAC TTCTTCGCCG ATTTCCTCAT 

651 ACAGCTTCTC GCCCTCGGCA GTCAGCTTCA GAAAAACACG TCGTTGGTCG 

701 TTGGAAGGTT TCAGGCGGAC AACCAAACCC GCTTTTTCAA GGCGGGTCAG 

751 GATACCGGTC AGGCTGGGGC GCAAAATGCA CGCCTGATTC GCCAAATCTT 
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This corresponds to the amino acid sequence <SEQ ID 564; ORF 142 a> 

Si SSS^HSSSSS ESSE SSSES 

m!42/al42 96.1% identity in 153 aa overlap 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 565>- 
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1201 CAGGCAACCA TGTTCTTGGT TGCAGGCGCA GTCTTGCTGC TGGGAGCCTT 

1251 CTCAGTCTGT CTGATTAAAG AGATCCACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 566; ORF 143.ng>: 

gl43 . pep 

1 MLSFGYLGVQ TAFTLQSSQM SRIFQTLGAD PHNLGW FFIL PPLAGMLVQP 

51 IVGYYSDRTW KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKSYAY GIQSFLANTD 

151 AWAAILPFV FAY I G LA NTA EKGWPQT W VAFYVGAALL IITSA FTISK 

201 VKEYDPETYA RYHGIDVAAN QEKANWFELL KTAPKVFWTV TPVQFFCWFA 

2 51 FRYMWTYSAG AIAENVWHTT DASSVGHQEA GNRYG VLAAV *SVAAVICSF 

301 ILAKVPNKYH KAG YFGCLAL GALGFFSIFF IYN QYALILS YILIGIAWAG 

351 IITYPLTIVA NALSGKHMDT YLGLFN GSVC MPQIVASLLS FVL FPMLGGH 

4 01 QATMFLVAGA VLLLGAFSVC LIKEIHGGV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 567>: 

ml 4 3 . seq 

1 ATGCTCAGTT TCGGCTTTCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAAATG AGCCGCATTT TTCAAACGCT AGGCGCAGAC CCGCACAATT 

101 TGGGCTGGTT TTTCATCCTG CCGCCGCTGG CGGGGATGCT GGTGCAGCCG 

151 ATTGTCGGCC ATTACTCCGA CCGCACTTGG AAGCCGCGTT TGGGCGGCCG 

201 CCGTCTGCCG TATCTGCTTT ATGGCACGCT GATTGCGGTT ATTGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCTTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTAGACG TGTCGTCAAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGACATG GTCAACGAGG 

4 01 AGCAGAAAGG CTACGCCTAC GGGATTCAAA GTTTCTTAGC AAATACGGGC 

4 51 GCGGTCGTGG CGGCGATTCT GCCGTTTGTG TTTGCGTATA TCGGTTTGGC 

501 GAACACCGCC GAGAAAGGCG TTGTGCCGCA GACCGTGGTC GTGGCGTTTT 

551 ATGTGGGTGC GGCGTTGCTG GT G ATT AC C A GCGCGTTCAC GATTTTCAAA 

601 GTGAAGGAAT ACGAT CCGG A AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGAT CGAACTCTTG AAAACCGCGC 

7 01 CTAAGGCGTT TTGGACGGTT ACTTTGGTGC AATTCTTCTG CTGGTTCGCC 

7 51 TTCCAATATA TGTGGACTTA CTCGGCAGGC GCGATTGCGG AAAACGTCTG 

801 GCACACCACC GATGCGTCTT CCGTAGGTTA TCAGGAGGCG GGTAACTGGT 

851 ACGGCGTTTT GGCGGCGGTG CAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 

901 GTATTGGCGA AAGTGCCGAA TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GCTTTTTCTC CGTTTTCTTC ATCGGCAACC 

1001 AATACGCGCT GGTGTTGTCT TATACCTTAA TCGGCATCGC TTGGGCGGGC 

1051 ATTATCACTT ATCCGCTGAC GATTGTGACC AACGCCTTGT CGGGCAAGCA 

1101 TATGGGCACT TACTTGGGCT TGTTTAACGG CTCTATCTGT ATGCCTCAAA 

1151 TCGTCGCTTC GCTGTTGAGT TTCGTGCTTT TCCCTATGCT GGGCGGCTTG 

1201 CAGGCCACTA TGTTCTTGGT AGGGGGCGTC GTCCTGCTGC TGGGCGCGTT 

1251 TTCCGTGTTC CTGATTAAAG AAACACACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 568; ORF 143>: 

ml 4 3 . pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PHNLG WFFIL PPLAGMLVQP 

51 IVGHYSDRTW KPRLGGR RLP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

151 AWAAILPFV FAY I G LA NTA EKGWPQT W VAFYVGAALL VITSA FTIFK 

201 VKEYDPETYA RYHGIDVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DAS S VGYQEA GNWYG VLAAV QSVAAVICSF 

301 VLAKVPNKYH KAG YFGCLAL GALGFFSVFF IGNQ YALVLS YTLIGIAWAG 

351 IITYPLTIVT NALSGKHMGT YLGLFN GSIC MPQIVASLLS FVL FPMLGGL 

4 01 QATM FLVGGV VLLLGAFSVF LIKETHGGV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml43 / gl43 93.9% identity in 429 aa overlap 
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al43. seq 






1 


ATGCTCAGTT 


TCGGCTTTCT 


51 


CTCGCAGATG 


AGCCGCATCT 


101 


TCGGCTGGTT 


CTTTATCCTG 


151 


ATTGTCGGCC 


ATTACTCCGA 


201 


CCGTCTGCCG 


TATCTGCTTT 


251 


TTTTGATGCC 


GAACTCGGGC 


301 


GCTTTGTCGT 


TCGGCGCGCT 


351 


TATGGCGATG 


CAGCCGTTTA 


401 


AGCAGAAAGG 


CTACGCCTAC 


451 


GCGGTCGTGG 


CGGCGATTCT 


501 


GAACACCGCC 


GAGAAAGGCG 


551 


ATGTGGGTGC 


GGCGTTGCTG 


601 


GTGAAGGAAT 


ACAATCCGGA 


651 


CGCCGCGAAT 


CAGGAAAAAG 


701 


CTAAGGCGTT 


TTGGACGGTT 
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751 TTCCAATATA TGTGGACTTA CTCGGCAGGC GCGATTGCGG AAAACGTCTG 

801 GCACACCACC GATGCGTCTT CCGTAGGTTA TCAGGAGGCG GGTAACTGGT 

851 ACGGCGTTTT GGCGGCGGTG CAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 

901 GTATTGGCGA AAGTGCCGAA TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GCTTTTTCTC CGTTTTCTTC ATCGGCAACC 

1001 AATACGCGCT GGTGTTGTCT TATACCTTAA TCGGCATCGC TTGGGCGGGC 

1051 AT TATCACT T ATCCGCTGAC GATTGTGACC AACGCCTTGT CGGGCAAGCA 

1101 TATGGGCACT TACTTGGGCC TGTTTAACGG CTCTATCTGT ATGCCGCAAA 

1151 TCGTCGCTTC GCTGTTGAGT TTCGTGCTTT TCCCTATGCT GGGCGGCTTG 

1201 CAGGCCACTA TGTTCTTGGT AGGGGGCGTC GTCCTGCTGC TGGGCGCGTT 

1251 TTCCGTGTTC CTGATTAAAG AAACACACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 570; ORF 143.a>: 

al43 .pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PHSLGW FFIL PFLAGMLVQP 

51 IVGHYSDRTW KPRLGGR RLP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

151 AWAAILPFV FAYIGLA NTA EKGWPQ TW VAFYVGAALL VITSA FTIFK 

201 VKEYNPETYA RYHG I DVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DASSVGYQEA GNWYG VLAAV QSVAAVICSF 

301 VLAKVPNKYH KAGY FGCLAL GALGFFSVFF I GNQY ALVLS YTLIGIAWAG 

351 IITYPLTIVT NALSGKHMGT YLGLFNGSIC MPQ IVASLLS FVLFPMLGGL 

4 01 QATMFLVGGV VLLLGAFSVF LIKETHGGV* 



ml43/a!43 99.5% identity in 429 aa overlap 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 571>: 

gl4 4 . seq 

1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGCCTGATCG ACGGGCGGGC 
51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGTGC GTCTTCGTGC 
101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 
151 CGCGAAAACC CCGTGGTGTC GTTCGACGAT GCGGCTTCCT ATGCGGACAA 
201 TCCGTTTCAG ATTAACAAGC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 
251 GTGCGGCGTT CGACATCAAC GGTAGGACTT ACCGCGTGGA GGCCAACGAA 
301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCcgtTT 
351 CAACGCGGTG GCGGCAGACG GccgacggTt atCCCAACGA TTTGGatatT 
4 01 TCctaccgCT TGGACGAGGA CGGCCGGCTT ACCGTtaccT ATCGCGCCAC 
4 51 CGCgctCGGC GACACGGTGT TCGACCCGAC GCTGCACATT TACTGGCGGC 

501 TGGACGCGGG CCTGCACGAT GCGGTTCTGC ATATTCCGCA GGGCGGACAT 

551 ATTCCGGCCG ATGCCGAAAA ACTGCCCGTC TTAACGGTTT CAGACGGCCT 

601 CGAAGTATTT GA 

This corresponds to the amino acid sequence <SEQ ID 572; ORF 144.ng>: 

gl44 .pep 

1 MSDTPATRDF GLIDGRAVTG YVLSNRRGT C VFVLDLGGIV QEFSVL ADGV 

51 RENEWS FDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRRLSQR FGYFLPLGRG RPAYRYLSRH 

151 RARRHGVRPD AAHLLAAGRG PARCGSAYSA GRTYSGRCRK TARLNGFRRP 

201 RSI* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 573>: 

ml44 . seq 

1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGTCTGATCG ACGGGCGTGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGCGT GTCTGCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC TCGTGGTGTC GTTCGATGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAAC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGCAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCCGTTT 

351 CAACGCGGTG GCGGCAGACG GCCGTTCGGT GGTGCTGCGC AGCCGCCTGg 

4 01 CAACAGTCGG CCGACGGTTA TCCCAACGAT TTGGATTTGG ATATTTCCTA 

4 51 CCGCTTGGAC GAGGACGACC GGCTTACCGT TAcCTATCGC GCCACCGCGC 

501 TCGGCGACAC GGTGTTCGAC CCGACGCTGC ACATTTACTG GCGGCTGGAC 

551 GCGGGCCTGC ACGATGCGGT TCTGCATATT CCGCAGGGCG GACATATGCC 

601 GGCCGATGCC GAAAAACTGC CCGTCTCAAC GGTTTCAGAC GACCTCGAAG 

651 TATTTGA 

This corresponds to the amino acid sequence <SEQ ID 574; ORF 144>: 

ml4 4 .pep 

1 MSDTPATRDF GLIDGRAVTG YVLSNRRGT R VCVLDLGGIV QEFSVL ADGV 

51 RENLWSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRSWLR SRLATVGRRL SQRFGFGYFL 

151 PLGRGRPAYR YLSRHRARRH GVRPDAAHLL AAGRGPARCG SAYSAGRTYA 

201 GRCRKTARLN GFRRPRSI* - - • 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ml44 / gl44 91.3% identity in 218 aa overlap 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 575>: 

al44.seq 

1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGCCTGATCG ACGGGCGTGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGCGT GTCTGCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC TCGTGGTGTC GTTCGACGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAGC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGCAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCCGTTT 

351 CAACGCGGTG GCGGCAGACG GCCGTTCGGT GGTGCTGCGC AGCCGCCTG. 

4 01 CAACAGTCGG CCGACGGTTA TCCCAACGAT TTGGATTTGG ATATTTCCTA 

4 51 CCGCTTGGAC GAGGACGACC GGCTTACCGT TACCTATCGC GCCACCGCGC 

501 TCGGCGACAC GGTGTTCGAC CCGACGCTGC ACATTTACTG GCGGCTGGAC 

551 GCGGGCCTGC ACGATGCGGT TCTGCATATT CCGCAGGGCG GACATATTCC 

601 GGCCGATGCC GAAAAACTGC CCGTCTCAAC GGTTTCAGAC GACCT CGAAG 

651 TATTTGA 

This corresponds to the amino acid sequence <SEQ ID 576; ORF 144.a>: 

al4 4 . pep 

1 MSDTPATRDF GLIDGRAVTG YVLSNRRGTR VCVLDLGGIV QEFSVLA DGV 

51 RENLVVSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTR FN AV AADGRS WLR SRLXTVGRRL SQRFGFGYFL 

151 PLGRGRPAYR YLSRHRARRH GVRPDAAHLL AAGRGPARCG SAYSAGRTYS 

201 GRCRKTARLN GFRRPRSI* 

m!44/al44 99.1% identity in 218 aa overlap 
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The foUowingpartial DNA sequence was identified in N. gonorrhoeae <SEQ ID 577>: 

1 ATGAAGCAAA TCCCCCTCCG CCTTCTCCAG GTCGTCATTG ACCACGACAA 
51 AGTCGAACAA TACGGACTGT TCGATTTCAT GCCTTGCCTT SaSc^ 
101 CTTTGGATAa ctTCCCGACT GTCCGTCCCG CGCcctTTGA gSSgCGGC 

151 aagcacgtcg aaagaaggcg gcaggataaa gataccgaca gcSccggca 
201 gcgcgttgcg aacctgcgcc gcgccctgaa cgtcgatttc cSa^JScg 

251 TCATAGCCTG CCGCCGCCAA CGCATTCACG CCCTCCGTGC SSSSSS 

111 ^™^ CG AATACGTCT G CGTATTCCAA AAAAGCCTCC TOCgSSS 
^™ CAAA CTCTTCTT TG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

401 TCGCCTTCAC GCGGCGGGCG CGTCGTATGC GACACGGAAA CGCGCAAACC 

451 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGAcISS 

551 TTTACCTGTA TATTTTCCAA CCGATTGTAT CACAACGGAC ACCESS? 

601 ATATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA ACCCTATTTC 

This corresponds to the amino acid sequence <SEQ ID 578- ORF 146 ne>- 

gl46.pep 5 • 5 • 



1 MKQIPLRLLQ WIDHDKVEQ YGLFDFMPCL RQPPLDNFPT VRPAPFEARG 

SSSKSJ S??^ VA NLRRALNVDF ONHVIACRRQ R^S 
I^^FQ KSLLRDKRFK LFFGNKVIMY AVCFAFTRRA R RMRHGNAQT 

201 JSSSSS Hl^ 30 * nd >™afsis ghxfylyifq pivsqrtp?f 



101 



15l SiS KSES i™* ??™ Sag? 

201 

^ f ^r^f q Partial DNA sequence was identified in AT. meningitidis <SEQ ID 579>: 

1 ATGGCGCAAA TCCTCCTCCG CTCGCGCCAA GTCGTCATTG ACCACGACAA 



AGTCAAACAA TACGGACTGC TCGATTTCAT GCCTTGCCTT SSSSSJ 

ill SS£E£ CTTCCCGA ^ GTCCGTCCCG CGTCCGTTgI gSSgcggC 

201 AAAGAAGGCG GCAGGATAAA GATGCCGACG GCTTCGGGCA 

201 GCGCGTCGCG AACCTGCGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAGCCTG CCGCCGCCAA CGCATTCACA CCCTCCGCGC Sg^Sa 

351 rrrfZ^? AATACGTCGG CGTATTCCAA AAAAGCTTCC TGCGcSJS 

n 5 GCGACTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

JS r™Zl CA ° GCGGCGGGCG CGTCGTGTGC GACACGGAAA CgSSSXS 

451 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGAoSS5 

Si ^™J GTA TATTTTCCAG CCGATTGTAT CACAATGGAC aSSgSc 

601 CTATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA ^ WMslllt - 

This corresponds to the amino acid sequence <SEQ ID 580- ORF 146>- 

ml4 6.pep ' 

1 MAQILLRSRQ WIDHDKVKQ YGLLDFMPCL RQPPLDNFPT VRPASVEARC 

51 KYVERRRQDK DADGFGQRVA NLRRALNVDF QNHVIACRRQ RlSSSJv 

101 IVAKYVGVFQ KSFLRDKRLK LFFGNKVIMY AVC^aS R rSrSS 

III S™ 68 ™ ^KDVAFSIS GHIFYLYIFQ EX£££ 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml46 / gl46 90.1% identity in 212 aa overlap 
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GHI FYLYI FQP I VSQWTPS FLFADAH I LPLLFX 
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GHIFYLYIFQPIVSQRTPYFIFADAHILPLLFX 

190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 58 1>: 

al46. seq 

1 ATGGCGCAAA TCCTCCTCCG CCCGCGCCAA GTCATCATTG ACCACGACAA 

51 AATCGAACAA TACGGACTGT TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAA CTTCCCGACT GTCCGTCCCG CGTCCGTTGA GACGCGCAGC 

151 AAGCACATCG AAAGACGGCG GCAGGATAAA GATGCCGACG GCTTCGGGCA 

201 GCGCATCTCG AACCTGAGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAACCTG CCGCCGCCAA CGCATTCACA CCCTCCGCGC TTGTGCCGTA 

301 ATAGTTGCCG AACACGTCCG CGTATTCCAA AAAAGCCTCC TGCGCGATAA 

351 GCGACTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

4 01 TCGCCTTCAC GCGGCGGACG CGTCGTGTGC GACACGGAAA CGCGCAAACC 

451 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAG CCGATTGTAT CACAACGGAC ACCCGGTTTC 

601 CTATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 582; ORF 146.a>: 

al46 . pep 

1 MAQILLRPRQ VIIDHDKIEQ YGLFDFMPCL RQPPLDNFPT VRPASVETRS 

51 KHIERRRQDK DADGFGQRIS NLSRALNVDF QNHVITCRRQ RIHTLRACAV 

101 IVAEHVRVFQ KSLLRDKRLK LFFGNKVIMY AVCFAFTRRT RRVRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFS I S GHIFYLYIFQ PIVSQRTPGF 

201 LFADAHILPL LF* 

ml46/al46 90.6% identity in 212 aa overlap 

10 20 30 40 50 60 

ml4 6 . pep MAQILLRSRQWIDHDKVKQYGLLDFMPCLRQPPLDNFPTVRPASVEARGKYVERRRQDK 
I M I I I I I I I : I I I I I :: I I II : I I I I I I I I I I I I I I I I I 1 I I I I I : I : I :: M I I I M 
al4 6 MAQILLRPRQVIIDHDKIEQYGLFDFMPCLRQPPLDNFPTVRPASVETRSKHIERRRQDK 

10 20 30 40 50 60 

7 0 80 90 100 110 120 



ml 4 6 -pep 
gl46 

ml 4 6 .pep 
gl46 

ml 4 6 . pep 
gl46 

ml 4 6. pep 
gl46 
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ml 4 6 . pep DADGFGQRVANLRRALNVDFQNHVIACRRQRIHTLRACAVIVAKYVGVFQKSFLRDKRLK 
M M ! II I : : I I I M I 1 1 I M [ M : I I 1 ! I I I ] | | | | | | | | | : : | i I | || : I I ! I ! I I 
DADGFGQRISNLSRALNVDFQNHVITCRRQRIHTLRACAVIVAEHVRVFQKSLLRDKRLK 
70 80 90 100 110 120 



al46 



130 140 150 160 170 180 

ml 4 6 . pep LFFGNBCVIMYAVCFAFTRRTiRRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFS I S 

1 M I I I I I I I [ I M i M I I : M f I ] I I |j M I I I 1 I I I I I I | M | | | f | | j [| I I M I ! 1 
LFFGNKVIMYAVCFAFTRRTRRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 
130 140 150 160 170 180 



al46 



190 200 210 

m!4 6.pep GH I FYLY I FQP I VSQWT PS FL FADAH I LPLLFX 

I I I I I I I I I I I I I M I I : I M I I I M I | M M 
al4 6 GHIFYLYIFQPIVSQRTPGFLFADAHILPLLFX 

190 200 210 



seq 


(partial) 


1 


. . ATGCGACGAG 


51 


ATCAATTCTT 


101 


AGCAATCGGT 


151 


CGCGCGACTT 


201 


CTCCGGCGAT 


251 


ACGGCGTACC 


301 


CCCGTTATTC 


351 


CGG CGAAACG 


401 


TAGATACCGC 


451 


CTCTTGTACA 


501 


gAAAAAtccc 



This corresponds to the amino acid sequence <SEQ ID 584; ORF 147.ng>: 

gl47 .pep (partial) 

1 ..MRREAKMAQI TLKPIVLSIL LINTPLLAQA HETEQSVGLE TVSWGKSRP 

51 RATSGLLHTS TASDKIISGD TLRQKAVNLG DALDGVPGIH ASQYGGGASA 

101 PVIRGQTGRR IKVLNHHGET GDMADFSPDH AIMVDTALSQ QVEILRGPVT 

151 LLYSSGNVAG AGQCCRWKNP PKNA. . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 585>: 

ml47.seq (partial) 

1 . . CCGCATAAAA CTGAGCAATC GGTGGATTTG GAAACGGTCA GCGTCGTCGG 

51 CAAAAGCCGT CCGCGCGCCA CGTCGGGGCT GTTGCACACT TCGACCGCCT 

101 CCGACAAAAT CATCTCCGGC GATACCTTGC GCCAAAAAGC CGTCAACTTG 

151 GGCGACGCTT TAGACGGCGT ACCGGGCATC CACGCTTCGC AATACGGCGG 

201 CGGCGCGTCT GCTCCCGTCA TTCGCGGTCA AACAGGCAGG CGGATTAAAG 

251 TGTTGAACCA TCACGGCGAA ACAGGCGATA TGGCGGATTT TTCGCCCGAT 

301 CACGCCATTA TGGTAGATAC CGCCTTGTCG CAACAGGTCG AAATCCTGCG 

351 CGGGCCGGTT ACGCTCTTGT ACAGCTCGGG CAATGTGGCG GGGCTGGTCG 

4 01 ATGTTGCCGA TGGCAAAATC CCCGAAAAAA TGCCTGAAAA CGGCGTATCG 

4 51 GGCGAACTCG GATTGCGTTT GAGCAGCGGC AATCTGGAAA AACTCACGTC 

501 CGGCGGCATC AATATCGGTT TGGGCAAAAA CTTTGTATTG CACACGGAAG 

551 GGCTGTACCG CAAATCGGGG GATTACGCCG TACCGCGTTA CCGCAATCTG 

601 AAACGCCTGC CCGACAGCCA CGCCGATTCG CAAACGGGCA GCATCGGGCT 

651 GTCTTGGGTT GGCGAAAAAG GTTTTATCGG CGTAGCGTAC AGCGACCGTC 

701 GCGACCAATA TGGTCTGCCT GCCCACAGCC ACGAATACGA TGATTGCCAC 

751 GCCGACATCA TCTGGCAAAA GAGCTTGATT AACAAACGCT AT TTACAGCT 

801 TTATCCGCAC CTGTTGACCG AAGAAGACAT CGATTACGAC AATCCGGGCT 

851 TGAGCTGCGG CTTCCACGAC GACGATAATG CACACGCACA CACCCACAGC 

901 GGCAGACCGt GGATAGACCT GCGCAACAAA CGCTACGAAC TCCGTGCCGA 

951 ATGGAAGCAA CCGTTCCCCG GTTTTGAAGC CCTGCGCGTA CACCTGAACC 

1001 GCAACGACTA CCGCCACGAC GAAAAAGCAG GCGATGCAGT CGAAAACTTT 

1051 TTTAACAACC AAACGCAAAA CGCCCGCATC GAGTTGCGCC ACCAACCCAT 
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1101 AGGTCGTCTG AAAGGCAGCT 

1151 GTGCTTTATC TGCCATATCC 

1201 AACAAAGTGC AACATTACAG 

1251 CAACT TCACG CTTGAAGGAG 

1301 TTCAGTACGA CAAAGCATTG 

1351 CTGCCCGACC TCGGCGCGCA 

14 01 GGGCAACTGG TATTTCACGC 

1451 ATCAGGAACG CCTGCCGTCA 

1501 GTCGCCACCA ACACCTTTGA 

1551 TTCCAACAAT ATCGAACTCG 

1601 ACAATCTGGC ACTCTACCGC 

1651 ACCTTAAACG ACGGACGCGG 

1701 GAAGCTCGTG CGCTACAACC 

1751 GCGAAATCTA CTTCAAACCG 

1801 GACTATGTAC GAGGCCGTCT 

1851 AGATGCCTAC GGCAACCGTC 

1901 CCCGTGTTCC GGCTGCGCGC 

1951 GACCGTATCG ATGCCAATTT 

2001 ACTCGCCCGC TACGAAACGC 

2 051 GCGCAAACTA CCGCCGCAAT 

2101 AAAGCCGACA ACCTGCTCAA 

2151 CTCTGATACG CCGCAAATGG 

2201 AGTTTTAA 



GGGGCGTGCA ATATTTACAA CAAAAATCCA 
GAAGCGGTTA AACAACCGAT GCTGCTTGAC 
CTTTTTCGGT GTAGAACAGG CAAACTGGGA 
GCGTACGCGT GGAAAAACAA AAAGCCTCCA 
ATTGATCGGG AAAACTACTA CAACCACCCC 
CCGCCAAACC GCCCGCTCAT TCGCACTTTC 
CACAACACAA ACTCAGCCTG ACCGCCTCCC 
ACGCAAGAGC TGTACGCACA CGGCAAACAC 
AGTCGGCAAC AAACACCTCA ACAAAGAGCG 
CGCTGGGCTA CGAAGGCGAC CGCTGGCAAT 
AACCGCTTCG GTAACTACAT TTACGCCCAA 
CCCCAAATCC ATCGAAGACG ACAGCGAAAT 
AATCCGGCGC CGACTTCTAC GGCGCGGAAG 
ACACCGCGCT ACCGCATCGG CGTTTCCGGC 
GAAAAACCTG CCTTCCCTAC CCGGCAGAGA 
CTTTCATCGC ACAGGACGAC CAAAATGCCC 
CTCGGCTTCC ACCTGAAAGC CTCGCTGACC 
GGACTACTAC CGCGTGTTCG CCCAAAACAA 
GCACGCCCGG ACACCATATG CTCAACCTCG 
ACGCGCTATG GCGAGTGGAA TTGGTACGTC 
CCAATCCGTT TACGCCCACA GCAGCTTTCT 
GCCGCAGCTT TACCGGCGGC GTGAACGTGA 



This corresponds to the amino acid sequence <SEQ ID 586; ORF 147>: 

ml47.pep (partial) 

1 . . PHKTEQSVDL ETVSWGKSR PRATSGLLHT STASDKIISG DTLRQKAVNL 

51 GDALDGVPGI HASQYGGGAS APVTRGQTGR RIKVLNHHGE TGDMADFSPD 

101 HAIMVDTALS QQVEILRGPV TLLYSSGNVA GLVDVADGKI PEKMPENGVS 

151 GELGLRLSSG NLEKLTSGGI NIGLGKNFVL HTEGLYRKSG DYAVPRYRNL 

201 KRLPDSHADS QTGSIGLSWV GEKGFIGVAY SDRRDQYGLP AHSHEYDDCH 

251 ADIIWQKSLI NKRYLQLYPH LLTEEDIDYD NPGLSCGFHD DDNAHAHTHS 

301 GRPWIDLRNK RYELRAEWKQ PFPGFEALRV HLNRNDYRHD EKAG DAVEN F 

351 FNNQTQNARI ELRHQPIGRL KGSWGVQYLQ QKSSALSAIS EAVKQPMLLD 

4 01 NKVQHYS FFG VEQANWDNFT LEGGVRVEKQ KASIQYDKAL IDRENYYNHP 

4 51 LPDLGAHRQT ARSFALSGNW YFTPQHKLSL TASHQERLPS TQELYAHGKH 

501 VATNTFEVGN KHLNKERSNN IELALGYEGD RWQYNLALYR NRFGNYIYAQ 

551 TLNDGRGPKS IEDDSEMKLV RYNQSGADFY GAEGEIYFKP TPRYRIGVSG 

601 DYVRGRLKNL PSLPGREDAY GNRPFIAQDD QNAPRVPAAR LGFHLKASLT 

651 DRIDANLDYY RVFAQNKLAR YETRTPGHHM LNLGANYRRN TRYGEWNWYV 

701 KADNLLNQSV YAHSSFLSDT PQMGRSFTGG VNVKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



ml47 / gl47 92.3% identity in 142 aa overlap 

10 20 30 

ml 4 7 . pep PHKTEQSVDLETVSWGKSRPRATSGLLHTS 

I = I I I I I I I I I I M i I! I I I I I I I | | | | | 
9147 MRREAKMAQITLKPIVLSILLINTPLLAQAHETEQSVGLETVSWGKSRPRATSGLLHTS 

10 20 30 40 50 60 

40 50 60 70 80 90 

ml47 . pep TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 

1 1 1 1 1 ' i ii m 1 1 1 1 1 n 1 1 n i n 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 n n i j 1 1 1 1 1 1 1 1 1 1 1 1 

gl47 TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 

70 80 90 100 110 120 

10 ° HO 120 130 140 150 

ml 4 7 . pep GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 

' t I I I I I 1 I I I I M M I M I I II I I I I | M I I I I I I I II I : I | | 

gl47 GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGAGQCCRWKNPPKNA 
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The following partial DNA sequence was 

al47 .seq 

1 ATGCGACGAG AAGCCAAAAT GGCACAAACT 
51 ATCAATTCTT TTAATCAACA CACCCCTCCT 
101 AGCAATCAGT GGGCTTGGAA ACGGTCAGCG 
151 CGCGCCACTT CGGGGCTGCT GCACACTTCT 
201 CAGCGGCGAC ACCTTGCGAC AAAAAGCCGT 
251 ACGGCGTACC GGGCATTCAT GCCTCGCAAT 
301 CCCGTTATTC GCGGTCAAAC AGGCAGACGG 
351 CGGCGAAACG GGCGACATGG CGGACTTCTC 
4 01 TGGACAGCGC CTTGTCGCAA CAGGTCGAAA 
4 51 CTCTTGTACA GCTCGGGCAA TGTGGCGGGG 
501 CAAAATCCCC GAAAAAATGC CTGAAAACGG 
551 TGCGTTTGAG CAGCGGCAAT CTGGAAAAAC 
601 ATCGGTTTGG GCAAAAACTT TGTATTGCAC 
651 ATCGGGGGAT TACGCCGTAC CGCGTTACCG 
701 ACAGCCACGC CGATTCGCAA ACGGGCAGCA 
751 GAAAAAGGCT TTATCGGCGC AGCATACAGC 
801 TCTGCCTGCC CACAGCCACG AATACGATGA 
851 GGCAAAAGAG TTTGATTAAC AAACGCTATT 
901 TTGACCGAAG AAGACATCGA TTACGACAAT 
951 TCACGACGAC GATGATGCAC ACGCCCATGC 
1001 TAGACCTGCG CAACAAACGC TACGAACTCC 
1051 TTCCCCGGTT TTGAAGCCCT GCGCGTACAC 
1101 CCACGACGAA AAAGCAGGCG ATGCAGTAGA 
1151 CGCAAAACGC CCGTATCGAG TTGCGCCACC 
1201 GGCAGCTGGG GCGTGCAATA TTTGGGACAA 
1251 CACATCCGAA GCGGTCAAAC AACCGATGCT 
1301 ATTACAGCTT TTTCGGTGTA GAACAGGCAA 
1351 GAAGGCGGCG TACGCGTGGA AAAACAAAAA 
14 01 AGCATTGATT GATCGGGAAA ACTACTACAA 
14 51 GCGCGCACCG CCAAACCGCC CGCTCATTCG 
1501 TTCACGCCAC AACACAAACT CAGCCTGACC 
1551 GCCGTCAACG CAAGAGCTGT ACGCACACGG 
1601 CCTTTGAAGT CGGCAACAAA CACCTCAACA 
1651 GAACTCGCGC TGGGCTACGA AGGCGACCGC 
1701 CTACCGCAAC CGCTTCGGCA ACTACATTTA 
1751 GACGCGGCCC CAAATCCATC GAAGACGACA 
1801 TACAACCAAT CCGGTGCGGA CTTCTACGGC 
1851 CAAACCGACA CCGCGCTACC GCATCGGCGT 
1901 GCCGTCTGAA AAACCTGCCT TCCCTACCCG 
1951 AACCGCCCAC TCATTGCCCA AGCCGACCAA 
2001 TGCGCGCCTC GGCGTCCACC TGAAAGCCTC 
2051 CCAATTTGGA CTACTACCGC GTGTTCGCCC 
2101 GAAACGCGCA CGCCCGGACA CCATATGCTC 
2151 CCGCAATACG CGCTATGGCG AGTGGAATTG 
2201 TGCTCAACCA ATCCGTTTAC GCCCACAGCA 
2251 CAAATGGGCC GCAGCTTTAC CGGCGGCGTG 



identified in N. meningitidis <SEQ ED 587>: 



ACACTCAAAC 
CTCCCAAGCG 
TCGTCGGCAA 
ACCGCCTCCG 
CAACTTGGGT 
ACGGCGGCGG 
ATTAAAGTGT 
TCCAGACCAT 
TCCTGCGCGG 
CTGGTCGATG 
CGTATCGGGC 
TCACGTCCGG 
ACGGAAGGGC 
CAATCTGAAA 
TCGGGCTGTC 
GACCGTCGCG 
TTGCCACGCC 
TGCAGCTTTA 
CCGGGCTTGA 
CCACAACGGC 
GCGCCGAATG 
CTGAACCGCA 
AAACTTTTTT 
AACCCATAGG 
AAATCCAGTG 
GCTTGACAAT 
ACTGGGACAA 
GCCTCCATCC 
CCATCCCCTG 
CACTTTCGGG 
GCCTCCCATC 
CAAACACGTC 
AAGAGCGTTC 
TGGCAATACA 
CGCCCAAACC 
GCGAAATGAA 
GCGGAAGGCG 
TTCCGGCGAC 
GCAGGGAAGA 
AACGCCCCTC 
GCTGACCGAC 
AAAACAAACT 
AACCTCGGCG 
GTACGTCAAA 
GCTTCCTCTC 
AACGTGAAGT 



CCATTGTTTT 
CATGGAACTG 
AAGCCGTCCG 
ACAAAATCAT 
GATGCTTTAG 
CGCATCCGCT 
TGAACCATCA 
GCAATCATGG 
TCCGGTTACG 
TTGCCGATGG 
GAACTCGGAT 
CGGCATCAAT 
TGTACCGCAA 
CGCCTGCCCG 
TTGGGTTGGC 
ACCAATATGG 
GACATCATCT 
TCCGCACCTG 
GCTGCGGCTT 
AAACCTTGGA 
GAAGCAACCG 
ACGACTACCG 
AACAACCAAA 
CCGTCTGAAA 
CTTTATCTGC 
AAAGTGCAAC 
CTTCACGCTT 
GCTACGACAA 
CCCGACCTCG 
CAACTGGTAT 
AGGAACGCCT 
GCCACCAACA 
CAACAATATC 
ATCTGGCACT 
TTAAACGACG 
GCTCGTGCGC 
AAATCTACTT 
TATGTACGAG 
CGCCTACGGC 
GCGTTCCGGC 
CGCATCGATG 
CGCCCGCTAC 
CAAACTACCG 
GCCGACAACC 
TGATACGCCG 
TTTAA 



This corresponds to the amino acid sequence <SEQ ID 588; ORF 147,a>* 

al47.pep 3 

MRREAKMAQT TLKPIVLSIL LINTPLLSOA HG TF.n.9Vr,T.p TVSWGKSRP 
RATSGLLHTS TASDKIISGD TLRQKAVNLG DALDGVPGIH AS QYGGGAS A 
PVIRGQTGRR IKVLNHHGET GDMADFSPDH AIMVDSALSQ QVEILRGPVT 
LLYSSGNVAG LVDVADGKIP EKMPENGVSG ELGLRLSSGN LEKLTSGGIN 
IGLGKNFVLH TEGLYRKSGD YAVPRYRNLK RLPDSHADSQ TGSIGLSWVG 
™^™ YS DRRD Q YGLPA HSHEYDDCHA DIIWQKSLIN KRYLQLYPHL 
LTEEDIDYDN PGLSCGFHDD DDAHAHAHNG KPWIDLRNKR YELRAEWKQP 
FPGFEALRVH LNRNDYRHDE KAGDAVENFF NNQTQNARIE LRHQPIGRLK 
GSWGVQYLGQ KSSALSATSE AVKQPMLLDN KVQHYSFFGV EQANWDNFTL 
EGGVRVEKQK ASIRYDKALI DRENYYNHPL PDLGAHRQTA RSFALSGNWY 
FTPQHKLSLT ASHQERLPST QELYAHGKHV ATNTFEVGNK HLNKERSNNI 
ELALGYEGDR WQYNLALYRN RFGNYIYAQT LNDGRGPKSI EDDSEMKLVR 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
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601 YNQSGADFYG AEGEIYFKPT PRYRIGVSGD YVRGRLKNLP SLPGREDAYG 

651 NRPLIAQADQ NAPRVPAARL GVHLKASLTD RIDANLDYYR VFAQNKLARY 

701 ETRTPGHHML NLGANYRRNT RYGEWNWYVK ADNLLNQSVY AHSSFLSDTP 

7 51 QMGRS FTGGV NVKF* 

ml47/al47 98.1% identity in 734 aa overlap 

10 20 30 

ml 4 7 .pep PHKTEQS VDLET VS WGKSRPRAT SGLLHTS 

I I I I I I I I I t I! I I I I I I I I I I I I I I i I 
al47 MRREAKMAQTTLKPIVLSILLINTPLLSQAHGTEQSVGLETVSWGKSRPRATSGLLHTS 

10 20 30 40 50 60 

40 . 50 60 70 80 90 

ml47 . pep TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
M I I I II I I I I I I I I M I II I I I I i I I I I I I M I I I I I I I I I M I I I I I I I I I I i M I I I 
al47 TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 

70 80 90 100 110 120 

100 110 120 130 140 150 

ml47 . pep GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 
I I I I I I M I I I I M I : I I i I I I I I I I I I I I I I I I I I II I I I M I I I I II 11 I I I I I I I I I 
al47 GDMADFSPDHAIMVDSALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 

130 140 150 160 170 180 

160 170 180 190 200 210 

ml 4 7. pep ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQ 
I I I I I I M I I I I I I M I I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I 
al4 7 ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQ 

190 200 210 220 230 240 

220 230 240 250 260 270 

ml 47 .pep TGSIGLSWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHL 
I I I I I M I I I M I I I I : I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I M I I I I I I I I 
al47 TGSIGLSWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHL 

250 260 270 280 290 300 

280 290 300 310 320 330 

ml47 . pep LTEEDIDYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVH 
I M I I I I I I I II I I I II I I I I : I I I I : I : I : I I I I II I I I I M I I I I I I I I I II II I I I I 
al47 LTEEDIDYDNPGLSCGFHDDDDAHAHAHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVH 

310 320 330 340 350 360 

340 350 360 370 380 390 

ml 4 7 . pep LNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISE 
M J II I I I I M I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I It 
al4 7 LNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLGQKSSALSATSE 

370 380 390 400 410 420 

400 410 420 430 440 450 

ml4 7 . pep AVKQPMLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKAS I QYDKALI DRENYYNHPL 

M I I M I I I I I I I I I M I I I I I II I I I I I I I I I I I M I I I I I I : I I I I I I I I I I I I I I I I 
al47 AVKQPMLLDNKVQHYSFFGVEQANWDN FT LEGGVRVEKQKAS I RYDKAL I DRENYYNHPL 

430 440 450 460 470 480 

460 470 480 490 500 510 

ml 4 7 . pep PDLGAHRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 
I I ! I I M II I I I I I I I I I I I I I I I I I I I I I I || I I | | || | | | | | | | | | | | | | | | | | | | | j 
al47 PDLGAHRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 

490 500 510 520 530 540 

520 530 540 550 560 570 

ml4 7 . pep HLNKERSNNIELALGYEGDRWQYNLALYRNRFGNYI YAQTLNDGRGPKSIEDDSEMKLVR 

I I I I I I I I I I I I I I I I I M M I I ! I I I I I I I | I | | | | | | | | | 1 | | | | M I I I I 

al47 HLNKERSNN IE LALGYEGDRWQYNLALYRNRFGNYI YAQTLNDGRGPKSIEDDSEMKLVR 
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55 0 560 570 580 



590 600 



580 590 600 610 620 630 

ml 4 7. pep YNQSGADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDO 
l,,tMI| fMIM!llllljtltllMMilllllll||||M|M!llilt:lll II 
YNQSGADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLIAQADQ 
610 6 20 630 640 650 660 



al47 



640 650 660 670 680 690 

ml47.pep N APR VPAARLGFHLKAS LT DR I DANLD Y YRVFAQNKLAR YE TRT PGHHMLN LG AN YRRNT 

1 11 1 1 1 1 1 1 1 ' 1 1 > I I I i I I I I I I M j j | I | | |f 1 I ! I I I E j | | | j | j | f | | | | j | i | I 
NAPRVPAARLGVHLKASLTDRI DANLD Y YRVFAQNKLAR YET RTPGHHMLNLGAN YRRNT 
670 680 690 700 710 720 



al47 



700 710 720 730 

ml 4 7 . pep RYGEWNWYVKADNLLNQS VYAHS S FLS DT PQMGRS FTGGVN VKFX 

1 N I i M I I I I I I I I I I | | | | | M | ! | | | j | | | || | || , , | , , , , 
RYGEWNWYVKADNLLNQS VYAHS S FLS DT PQMGRS FTGGVNVKFX 
730 740 750 760 



al47 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 589>- 

gl48.seq ^ 

1 ATGGCGTTAA AAACATCAAA CTTGGAACAC GCAATGctgg ttcaTCCCGA 

51 AgctATgagt gtcggcgCGC TTGccgAcaa AATCCGCAAA AtcgaAAact 

101 gGCCGCAAAA AGgcaTCTTA TTCCACGACA TCACGCCCGT CCTGCAAAGT 

151 GCGGAATACT TCCGCCTTTT GGTCGATTTG CTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGCTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAaCGtcg gctTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTAcg cgcTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGa tgccgTCAAA CCCGGTTCGC 

4 01 GCGTCCTGCT GGTCGATGAT TTGGTTGCCA CGGGCGGCAC AATGCTTGCC 

4 51 GGGCTGGAAC TGATCCGCAA ACTCGGCGGG GAAATTGTCG AAgccgccgC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGCGCAAGTG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGCAT GAAAGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 590; ORF 148.ng>: 

gl48.pep * 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YO LNVGFVPIRK 

101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK PGSRVLLVDD LVATGGTMLA 

151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 59 1>- 

ml48.seq 

1 ATGGCGTTAA AAACATCAAA CTTGGAACAC GCAATGCTGG TTCATCCCGA 

51 AGCTATGAGT GTCGGCGCGC TTGCCGACAA AATCCGCAAA ATCGAAAACT 

101 GGCCGCAAAA AGGCATCTTA TTCCACGACA TCACGCCCGT CCTTCAAAGC 

151 GCGGAATACT TCCGCCTTTT GGTTGATTTA TTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGTTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAACGTCG GTTTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTACG CGCTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGA TGCCGTCAAA CTCGGTTCGC 

4 01 GCGTGCTGCT GGTCGATGAT TTGATTGCCA CGGGCGGCAC GATGCTTGCC 

4 51 GGACTGGAAC TGATCCGCAA ACTCGGCGGA GAAATTGTCG AAGCCGCCGC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGTGCAAGCG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGTAT GAAGGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 592; ORF 148>: 

ml 4 8. pep ' 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 
51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YO LNVGFVPIRK 
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101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK LGSRVLLVDD LIATGGTMLA 
151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml48 / gl48 99.0% identity in 199 aa overlap 

10 20 30 40 50 60 

ml 4 8 . pep MALKTSNLEHAMLVHPEAMSVGAIiADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 
I I I I I I I I I I I I I I I I I M I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I 
gl4 8 MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml4 8 . pep LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 
I I I I I I I I I I ! I I 1! I I I I I I I t I I I I I I I I I I I I I M I I I I I M I I M I I I I I I ! I I I I 
g!4 8 LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 8 . pep AVE I HTDAVKLGSRVLLVDDLIATGGTMLAGLE LI RKLGGE I VE AAAI LE FTDLQGGKNI 

I I I I I 1 t I I I I I I I I I M I ! : I I I I I I I I I I I I I I I I I | U | | | | M | | | | M | | | ! | | 
gl4 8 AVE I HT DAVKPGSRVLLVD DLVATGGTMLAGLE LI RKLGGE I VE AAAI LE FT DLQGGKN I 

130 140 150 160 170 180 

190 200 
ml 4 8 . pep RASGAPLFTLLQNEGCMKGX 
I I 1 I I I I I I I I I I I I M I I I 
g 1 4 8 RAS GAPLFTLLQNEGCMKGX 

190 200 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 593>: 

al48 . seq 

1 ATGGCGTTAA AAAC AT C AAA CTTGGAACAC GCAATGCTGG TTCATCCCGA 

51 AGCTATGAGT GTCGGTGCGC TTGCCGACAA AATCCGCAAA ATCGAAAACT 

101 GGCCGCAAAA AGGCATCTTA TTCCACGACA TCACGCCCGT CCTGCAAAGC * 

151 GCGGAATACT TCCGACTTTT GGTTGATTTA TTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGTTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAACGTCG GTTTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTACG CGCTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGA TGCCGTCAAA CTCGGTTCGC 

4 01 GCGTGCTGCT GGTCGATGAT TTGGTTGCCA CGGGCGGCAC GATGCTTGCC 

4 51 GGACTGGAGC TGATCCGCAA ACTCGGCGGG GAAATTGTCG AAGCCGCCGC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGTGCAAGCG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGTAT GAAGGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 594; ORF 148.a>: 

al4 8 .pep 

1 MALKTSNLEH AMLVH PEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 

101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK LGSRVLLVDD LVATGGTMLA 

151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

ml48/al48 99.5% identity in 199 aa overlap 

10 20 30 40 50 60 

ml 4 8 . pep MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQS AEYFRLLVDL 

I I M I I I M I I I I I I I I I I I 1 II I I I I I I I I I I I M I I I I I I I M I I I I I I I 1 I I I I I I I 
al48 MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQS AEYFRLLVDL 

10 20 30 40 50 60 

70 80 90 100 110 120 



BNSDOCID: <WO B957280A2J_> 



WO 99/57280 ^_ „^„, 

PCT/US99/09346 

418 



iril 4 8 . pep 
a!48 



70 80 9° 100 no 120 

190 200 
ml 4 8 . pep RASGAPLFTLLQNEGCMKGX 
N I I I I I I I | j | ! f | | j | j | 
RASGAPLFTLLQNEGCMKGX 
190 200 



al48 



TT^e foUowing partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 595>: 

^ ™r GATT ° ACAACA ATGT CCGCCATTAC AGCTTTTTCG GTGTAGAACA 
ill GACAAC "CA CGCTTGAAGG CGGCGTACGC GTgSSaS 

101 AAAAAGCCTC CATCCGGTAC GACAAAGCAT TGATTGATCG AGAAAACTAC 

151 TACAACCAGC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCG^SSc 

20! GTTCGCACTT TCGGGCAACT GGTATTTCAC GCCACAC^ aSSSgCC 

251 TGACCGCCTC CCATCAGGAa cgCCTGCCGT CAACGCaagA a^SSgca 

5° ssss ca G t C c r\ c ^ gtcg9 L .ssss 

J5i CAACAAAGaG CgttccaacA atatcgaACT CGCGCTGGgc tAcaaaaaca 

til ATtSgS JESSS™ 0CMCCB «* GCAACCGAtT 

451 ATTTACGCCC AAACCTTAaa cgacggacgC GGCCCCAAAT CCATCgaaga 

501 cgacagcgaA ATGaagcTCG TGCGCTACAA CCAATCCGGT GCCGACTTCT 

551 ACGgcgcggA aggcgaaatc tACTTcaaaC CGAcACCGCG CTACCGCATr 

5° S£S£ ST'S aC9a " C ^ T CTGAAAA^CC SSSSSS 

651 ACCCGGCAGG gaagatccCT AcggcAAACG TCccttcaTC GCACAAGCCG 

75° SSST CC "« GCGC SCCTCGGcS S2£££ 

/^l ACCTCGCTAA CCGACCGTAT CGATGCCAAT TTGGACTACT ACCGCGTrTT 

III ESS**" ™^ CGCCC GCTACGAAAC GCGTACGCCC JSSSS 

90^ ™^ CT CGGTGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAcSS 

III ^ GGTACG TCAAAGC( =<3A CAACCTGCtc aACcaatCcg tTTACGCCCa 

951 CAGCAGCTTC CTCTCTGATA CGCCGCAAAt gGGCCGCAGC TTtgccgGCq 

1001 gcgtaAACGT GaAGTTttaA AJ-cgccgiscg 

TTus corresponds to the amino acid sequence <SEQ ID 596; ORF 149.ng>: 

1 MLIDMNVRHY SFFGVEQANW DNFTLEGGVR VEKQKASIRY DKALIDRENY 

51 YNQPLPDLGA HRQTARSFAL SGNWYFTPHH KLSLTASHQE rSstQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YKGDRWQ^L SySfS 

III " A ° TLNDGR GPKS *^ MKLVRYNQS G ADFYGAEGEI £££££ 

201 GVSGDYVRGR LKNLPSLPGR EDPYGKRPFI AQADQNAPRI PAARLGF^S 

251 TSLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

301 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FAGGVNVKF* 

ITie following partial DNA sequence was identified in ^meningitidis <SEQ ID 597>: 



51 
101 
151 
201 
251 
301 
351 
401 



I ^^T™ ACAACAAAGT GCAACATTAC AGCTTTTTCG GTGTAGAACA 
ll^ZT. ^ACAACTTCA CGCTTGAAGG AGGCGTACGC GT<SSiiS 
AAAAAGCCTC CATTCAGTAC GACAAAGCAT TGATTGATCG GGAAAACTAC 
I^CACC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCG^Sc 
ATTCGCACTT TCGGGCAACT GGTATTTCAC GCCACAACAC AAaSSScC 
TGACCGCCTC CCATCAGGAA CGCCTGCCGT CAACGCAAGA G^ctSgS 
SS^*"* ACGTCGC ^ CAACACCTTT GAAGTCGGCA SSSS 
CAACAAAGAG CGTTCCAACA ATATCGAACT CGCGCTGGGC tS£SS 
ACCGCTGGCA ATACAATCTG GCACTCTACC GCAACCG CTT SgSa^AC 
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451 ATTTACGCCC AAAC CTT AAA CGACGGACGC GGCCCCAAAT CCATCGAAGA 

501 CGACAGCGAA ATGAAGCTCG TGCGCTACAA CCAATCCGGC GCCGACTTCT 

551 ACGGCGCGGA AGGCGAAATC TACTTCAAAC CGACACCGCG CTACCGCATC 

601 GGCGTTTCCG GCGACTATGT ACGAGGCCGT CTGAAAAACC TGCCTTCCCT 

651 ACCCGGCAGA GAAGATGCCT ACGGCAACCG TCCTTTCATC GCACAGGACG 

701 ACCAAAATGC CCCCCGTGTT CCGGCTGCGC GCCTCGGCTT CCACCTGAAA 

751 GCCTCGCTGA CCGACCGTAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

801 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGCACGCCC GGACACCATA 

851 TGCTCAACCT CGGCGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCTC AACCAATCCG TTTACG CCCA 

951 CAGCAG CTTT CTCTCTGATA CGCCGCAAAT GGGCCGCAGC TTTACCGGCG 

1001 GCGTGAACGT GAAGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 598; ORF 149>: 

m!49 .pep 

1 MLLDNKVQHY SFFGVEQANW DNFTLEGGVR VEKQKASIQY DKALIDRENY 

51 YNHPLPDLGA HRQTARS FAL SGNWYFTPQH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YEGDRWQYNL ALYRNRFGNY 

151 I YAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

2 01 GVSGDYVRGR LKNLPSLPGR EDAYGNRPFI AQDDQNAPRV PAARLGFHLK 
251 ASLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

3 01 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FTGGVNVKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N.gonorrhoeae 

ORF 149 shows 95.9% identity over a 339 aa overlap with a predicted ORF (ORF 149.ng) 
from TV. gonorrhoeae: 

ml49/gl49 

10 20 30 40 50 60 

MLLDNKVQHY SFFGVEQANWDNFTL EGG VRVEKQKAS I QYDKAL I DRENYYNHPLPDLGA 

MM ' -MM MM I M M I M MMMMI M Ml Ml II M I hi 1 1 I MMI MM 

ML I DNNVRHYS FFGVEQANWDNFTLEGGVRVE KQKAS I RYDKAL I DRENY YNQ PLPDLGA 
10 20 30 40 50 60 

70 80 90 100 110 120 

HRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

I M I I II I I I I I I I I II I : 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

HRQTARSFALSGNWYTTPHHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 
70 80 90 100 110 120 

130 140 150 160 170 180 

RSNNI ELALGYEGDRWQ YNLALYRNRFGNYI YAQTLNDGRGPKS I EDDS EMKLVRYNQS G 

I M MM III'MII MIMI M IMMMMMM I Ml I M Ml I M M M II M II 

RSNNIELALGYKGDRWQYNLAAYRNRFGNYI YAQTLNDGRGPKS I EDDS EMKLVR YNQ SG 
130 140 150 160 170 180 

190 200 210 220 230 240 

ADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQNAPRV 

M M I II M II II II M I I I M I I I I MM IMMM II I M IMMIMI M!MM 

ADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDPYGKRPFIAQADQNAPRI 
190 200 210 220 230 240 

250 260 270 280 290 300 

PAARLGFHLKASLTDRIDANLDYYRVFAQNKIJ^YETRTPGHHMIjNLGANYRRNTRYGEW 

I i I 1 1 1 M MMI II M i I I i II I M MIMMMII II 1 1 M M M I II IM Ml I I M 

PAARLGFHLKTSLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 
250 260 270 280 290 300 

310 320 330 340 

NWYVKADNLLNQS VYAHS S FLSDTPQMGRS FTGGVNVKFX 

IIIIMIIMIMMIIIII IMIIIIIIIhllMIIII 



ml49 . pep 
gl49 

ml49.pep 
gl49 

ml49 .pep 
9149 

ml49 .pep 
gl49 

ml49 .pep 
gi49 

ml49 .pep 



BNSDOCID: <WO 9957280A2_I_> 



WO 99/57280 

M> PCT/US99/09346 

420 



9149 NWYVKADNIiLNQSVYAHSSFLSDTPOMGRSPAGGVNVKFX 

310 320 330 340 

The ^owing^partial DNA sequence was identified in N. meningitidis <SEQ ID 599>: 

^J^^ CTTG ACAATAAA ST GCAACATTAC AGCTTTTTCG GTGTAGAACA 



ioi aaSa^ GACAAC ™ oS^^S 

i ^ ^ AGCCTC CAT CCGCTAC GACAAAGCAT TGAT TGATCG GGaSaCTAC 

lm 1™™^° CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGC^gSc 

201 ATTCGCACTT TCGGGCAACT GGTATTTCAC GCCACAACAC AaIcTCAGCC 

251 TGACCGCCTC CCATCAGGAA CGCCTGCCGT CAACGCAAGA GcJSESS 

301 CACGGCAAAC ACGTCGCCAC CAACACCTTT S^SJS SSS 

351 CAACAAAGAG CGTTCCAACA ATATCGAACT CGCGCTGGGC TAC^GCG 

401 ACCGCTGGCA ATACAATCTG GCACTCTACC GCAACCGCTT 

451 ATTTACGCCC AAACCTTAAA CGACGGACGC GGCCCCAAAT GCMcS^ 

f Sf nrrrrrr^ ATGAAGCTCG TGCGCTACAA CCAATCCGGT GCGGAC^S 

60J It?™™ CGACAGCGGG CTACCGCSJ 



GGCGTTTCCG GCGACTATGT ACGAGGCCGT CTGAAAAACC TGCCT^cJS 
651 ACCCGGCAGG GAAGACGCCT ACGGCAACCG CCCACTCATT GCcS^GCCG 
™ ™^ CG ° CCCTGG CGTT CCGGCTGCGC GCCTCGGCGT cSSgaS 

751 gcctcgctga ccgaccgcat cgatgccaat ttggactact accgcgtgt^ 

IV; ^n AARAC ^TCSCCC GCTACGAAAC GCGCACgJcC SS£ 

851 TGCTCAACCT CGGCGCAAAC TACCGCCGCA ATACGCGCTA TGGCclrrrr 

Hi ^rrtrl^ TCAAAGCCGA CAACCTGCTC 

lo'SJ SSSSS SSSSS CGCCGCAAAT GGGCCG CAGC T^Sg 

This corresponds to the amino acid sequence <SEQ ID 600; ORF 149.a>- 

al4 9.pep 

1 MLLDNKVQHY SFFGVEQANW DNFTLEGGVR VEKQKASIRY DKALI DRENY 

51 YNHPLPDLGA HRQTARSFAL SGNWYFTPQH KLSLTASHQE RL^STOeSI 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YEGDRWQYNL SSSS 

lm rl^l LN ° GK GPKSIED ^E MKLVRYNQSG ADFYGAEGEI YFKP^PRYRI 

201 GVSGDYVRGR LKNLPSLPGR EDAYGNRPLI AQADQNAPRV PAARLGVHLK 

251 ASLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN 

301 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FTGGVNVKF* YRRNTRYGEW 

ml49/al49 98.8% identity in 339 aa overlap 

ml49.pep ^™^F^ 

MLLDNKVQHYSFFGVEQA1W 

10 20 30 40 50 60 

70 80 9 ° 100 no 120 

ml49.pep R ^^ G yff°™ 
-ii^YE^^ 

130 14 0 150 160 170 £ 



!SG 
180 



190 200 210 220 230 oah 

nmmiMMMMm^ 

al49 ADWSAPrplvUilii ' ' ' ' ' ' ' ! I I I I I I I I I I I I I I I I | : | | | 



ml 4 9 . pep 



' " ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 "II I I I I I I I I I I I I | | | | | I | | | | I I | • I I I i i i i i i i 
ADFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLP^ 

200 210 220 230 240 

250 260 270 280 , 0 n 

mi49. pep paa^gfhlkasltdridanldyyrvfaqnklaryetrtpghhmlnlganyrrnt: 



BNSDOCID: <WO 9957280A2J_> 



WO 99/57280 




PCT/US99/09346 



421 



I i N I I I I I I I I I I I I I M I I I I M I I I I I I j I I I I I I I I M I I | | I | | | | | | | | | | | | 
al4 9 PAARLGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 

250 260 270 280 290 300 

310 320 330 340 

ml 4 9 . pep NWYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 
M I I I I 1 I I I I I t I I I I I I I I I I I I | | M | | | I | | | I | | | 
al4 9 NW YVKADNLLNQS VYAHS S FLS DT PQMGRS FTGGVN VKFX 

310 320 330 340 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 601 >: 

gl49-l.seq 

1 ATGGCACAAA TCACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCGCCCAAG CGCATGAAAC TGAGCAATCG GTGGGCTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCGAC TTCGGGGCTG 

151 CTGCACACTT CGACCGCCTC CGACAAAATC ATCTCCGGCG ATACTTTGCG 

201 CCAAAAAGCC GTCAACTTGG GCGACGCTTT GGACGGCGTA CCGGGCATCC 

251 ACGCTTCGCA ATACGGCGGC GGCGCATCCG CTCCCGTTAT TCGCGGTCAA 

301 ACGGGCAGAC GGATTAAAGT ATTGAACCAT CACGGCGAAA CGGGCGATAT 

351 GGCGGACTTT TCTCCCGATC ACGCCATTAT GGTAGATACC GCCTTGTCGC 

401 AACAGGTTGA AATCCTGCGC GGGCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGAAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGaagccgG ATTGCGTTTG AGCAGCGGCA 

551 ATTTAGAAAA ACTGACATCC GCAGGCATCA ATATCGGACT GGGCAAAAAC 

601 TTCGTGCTGC ATACCGAAGG CTTGTACCGC AAATCGGGCG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAT GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTGG GCGAAAAAGG CTTTATCGGC 

751 GCAGCATACA GCGACCGTCG CGACCGCTAC GGCCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGTTTGATCA 

851 ACAAACGCTA TTTGCAGCTT TATCCGCACT TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTCCACGACG GCGACGGTGC 

951 ACACGCACAC ACCCACAACG GCAAACCGTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGCGCCGAA TGGAAGCAGC CATTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ATCTGAACCG CAATGACTAC CACCACGACG AAAAAGCAGG 

1101 CGATGCAGTA GAAAACTTCT TCAACAACAA AACACACAAC GCCCGTATCG 

1151 AGTTGCGCCA CCAACCCATA GGCCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTGGGAC AAAAATCCAG CGCGCTTTCC GCCATTCCCG AAACCGTCCA 

1251 ACAACCGATG TTGATTGACA ACAATGTCCG CCATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAATTGGGAC AACTTCACGC TTGAAGGCGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT CCGGTACGAC AAAGCATTGA TTGATCGAGA 

14 01 AAACTAC T AC AACCAGCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCGTT CGCACTTTCG GGCAACTGGT ATTTCACGCC AC AC C AC AAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAACT 

1551 GTACGCACAC GGCAAGCACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA GCCTACCGCA ACCGATTCGG 

1701 CAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGTGCC 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGT GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CGTCCCTACC CGGCAGGGAA GATCCCTACG GCAAACGTCC CTTCATCGCA 

1951 CAAGCCGACC AAAACGCCCC CCGCATTCCG GCTGCGCGCC TCGGCTTCCA 

2001 CCTGAAAACC TCGCTAACCG ACCGTATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG TACGCCCGGA 

2101 CACCATATGC TCAACCTCGG TGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTCCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TAAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 602; ORF 149-l.ng>: 

gl49-l .pep 

1 MAQITLKPIV LSILLINTPL LAQA HETEOS VGLETVSWG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDT ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGEAGLRL SSGNLEKLTS AGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 AAYSDRRDRY GLPAHSHEYD DCHADIIWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDGDGAHAH THNGKPWIDL RNKRYELRAE WKQPFPGFEA 
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351 LRVHLNRNDY HHDEKAGDAV ENFFNNKTHN ARIELRHQPI GRLKGSWGVO 

401 YLGQKSSALS AIPETVQQPM LIDNNVRHYS FFGVEQANWD NFTLEGG^ 

451 EKQKASIRYD KALIDREKYY NQPLPDLGAH RQTARSFALS GNWYFTPHHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 

<m nw^ Q ™ LA AYRNRFGN YI YAQTLNDGRG PKSIEDDSEM KLVRYNQSgI 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DPYGKRPFIA 

651 QADQNAPRIP AARLGFHLKT SLTDRIDANL DYYRVFAQNK LARYETRTPG 

751 Sv^ RRNTRYGEWN "*™LLN QSVYAHSSFL 

St^llS^ Paitial S6qUence was identified in .V. meningitidis <SEQ ID 603>: 

1 ATGGCACAAA CTACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 
51 CACACCCCTC CTCGCCCAAG CGCATGAAAC TGAGCAATCG GTGGATTTGG 
1?, CGTCGTCGG C AAAAGCCGTC CGCGCGCCAC GTCGGGGCTG 

151 TTGCACACTT CGACCGCCTC CGACAAAATC ATCTCCGGCG ATACCTTGCG 
201 CCAAAAAGCC GTCAACTTGG GCGACGCTTT AGACGGCGTA CCGGGcItCC 
251 ACGCTTCGCA ATACGGCGGC GGCGCGTCTG CTCCCGTCAT TCGCGGTTAA 
301 ACAGGCAGGC GGATTAAAGT GTTGAACCAT "cGgS CAGGcS 
351 GGCGGATTTT TCGCCCGATC ACGCCATTAT GGTAGATACC GCCTTGTCGC 
401 AACAGGTCGA AATCCTGCGC GGGCCGGTTA CGCTCTTGTA CAGCTCGGGC 
451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGCAAAATCC CCGA^A^T 
501 GCCTGAAAAC GGCGTATCGG GCGAACTCGG ATTGCGTTTG AGCAGCGGCA 
551 ATCTGGAAAA ACTCACGTCC GGCGGCATCA ATATCGGTTT GGGCaSc 
601 TTTGTATTGC ACACGGAAGG GCTGTACCGC AAATCGGGGG ATTACG^T 
651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAC GCCGATTCGC 
701 AAACGGGCAG CATCGGGCTG TCTTGGGTTG GC S SaTCGGC 
751 GTAGCGTACA GCGACCGTCG CGACCAATAT GGTCTGCCTG CCCACAGCCA 
801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGCTTGATTA 
851 ACAAACGCTA TTTACAGCTT TATCCGCACC TGTTGACCGA AGAAGACATC 
901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTCCACGACG ACGATAATGC 
951 ACACGCACAC ACCCACAGCG GCAGACCGTG GATAGACCTG 
1001 GCTACGAACT CCGTGCCGAA TGGAAGCAAC CGTTCCCCGG TTTTGAAGCC 
1051 CTGCGCGTAC ACCTGAACCG CAACGACTAC CGCCACGACG AAAAAGCAGG 
1101 CGATGCAGTC GAAAACTTTT TTAACAACCA AACGCAAAAC GCCctcATCG 
1151 AGTTGCGCCA CCAACCCATA GGTCGTCTGA AAGGCAGCTG GGGCGTGCAA 
1201 TATTTACAAC AAAAATCCAG TGCTTTATCT GCCATATCCG AAGCGGTTAA 
1251 ACAACCGATG CTGCTTGACA ACAAAGTGCA AC S TTTTTCGGTG 
1301 TAGAACAGGC AAACTGGGAC AACTTCACGC TTGAAGGAGG CGTACGCGTG 
1351 GAAAAACAAA AAGCCTCCAT TCAGTACGAC AAAGCATTGA TTGATCGGGA 
1401 AAACTACTAC AACCACCCCC TGCCCGACCT CGGCGCgSc J " 
1451 CCCGCTCATT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACAACACAAA 
1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAGCT 
1551 GTACGCACAC GGCAAACACG TCGCCACCAA CACCTTTGAA GTOGGcS 
lis, CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA CTCTACCGCA ACCGCTTCGG 
1701 TAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CcScCA 
1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGrrrr 
1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 
1851 CCGCATCGGC GTTTCCGGCG ACTATGTACG AGGCCGTCTG aS^CTGC 
1901 CTTCCCTACC CGGCAGAGAA GATGCCTACG GCAACCGTCC TTTCATCGCA 
1951 CAGGACGACC AAAATGCCCC CCGTGTTCCG GCTGCGCGCC TCGGCTTCCA 
2001 CCTGAAAGCC TCGCTGACCG ACCGTATCGA TGCCAATTTG GACTAC^ACC 
2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG CACGCCCGGA 
2101 CACCATATGC TCAACCTCGG CGCAAACTAC CGCCGCAATA CGCGCTATGG 
2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2251 A r^™ AG TCTGATACGC CGCAAATGGG SSSS 

2251 ACCGGCGGCG TGAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEO ID 604- ORF 149-1 >• 

ml 4 9-1. pop 9 

1 M&QTTLKPIV LSILLINTPL T.ApaHPTE-ng VDLETVSWG KSRPRATSGL 
51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 
101 TGRRIKVLNH HGETGDMADF SPDHAIMVDT ALSQQVEILR GPVTLLYSSG 
™ AGLVDVAD GK "EKMPEN GVSGELGLRL SSGNLEk£tS GGINlS 
201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 
251 VAYSDRRDQY GLPAHSHEYD DCHADIIWQK SLINKRYLQL YPHLLTEEDT 
301 DYDNPGLSCG FHDDDNAHAH THSGRPWIDL RNKRYELR^ WKQPFPGFEA 
351 LRVHLNRNDY RHDEKAGDAV ENFFNNQTQN ARIELRHQPI GRLKGSWGVO 
401 YLQQKSSALS AISEAVKQPM LLDNKVQHYS FFGVeS SeGGVRV 
451 EKQKASIQYD KALIDRENYY NHPLPDLGAH RQTARSFALS GNWYFTPQHK 
501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIeTaTgY 
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551 EGDRWQYKLA LYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DAYGNRPFIA 

651 QDDQNAPRVP AARLGFHLKA SLTDRIDANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

751 TGGVNVKF* 

ml49-l/gl49-l 96.2% identity in 758 aa overlap 

10 20 30 40 50 60 

ml49-l .pep MAQTTLKPIVLSILLINTPLLAQAHETEQSVDLETVSVVGKSRPRATSGLLHTSTASDKI 

ii' 1 1 1 1 ) 1 1 1 i 1 1 1 i I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

gl49-l MAQITLKPIVLSILLINTPLLAQAHETEQSVGLETVSWGKSRPRATSGLLHTSTASDKI 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 9-1 . pep ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 
t I II II I I I I I I I I I I I I I I I { I I I I I | I I I | | | | | | | ] | | | | | | | | | | | | | | | | | | | | | 
g 1 4 9 - 1 I SGDTLRQKAVNLGDALDGVPGI HASQYGGGASAPV I RGQTGRRI KVLNHHGETGDMADF 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 9-1 . pep S PDHAIMVDTALSQQVEI LRGPVTLLYSSGNVAGLVDVADGKI PEKMPENGVSGELGLRL 

I I I I I I M I I I j I I M I I I I I I I I I I M I I I I I I I I I I I I IE I M I I I I 

gl49-l SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGEAGLRL 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml49-l.pep SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 
I I I 1 I I I I I I : I I t I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 1 I 1 I I I I I I 
gl49-l SSGNLEKLTSAGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml49-l . pep SWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 
I I I I i I I I I I : M I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I f I i I I I I 
gl4 9-l SWVGEKGFIGAAYSDRRDRYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml49-l . pep DYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 
I I I I I I I M I I I I I : I M I 1 I : I : I I I I I I I | { I I I f I I I I I I E I I I I I I I I I II I I I I 
gl4 9-l DYDNPGLSCGFHDGDGAHAHTHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml4 9-l .pep RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISEAVKQPM 
: I I I M M I I I I I I I I : I : I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I : I : I I I 
gl49-l HHDEKAGDAVENFFNNKTHNARIELRHQPIGRLKGSWGVQYLGQKSSALSAIPETVQQPM 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 4 9-1 . pep LLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPLPDLGAH 
I : 1 I : I :! I I I I J I I I I I I I I I I I i I I M I I I I I I M : I I I I I I i ! I I I i I : | I j I | | | | 
gl4 9-1 LI DNNVRHYS FFGVEQANWDNFTLEGGVRVEKQKAS I RYDKALI DRENYYNQPLPDLGAH 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 4 9-1 . pep RQTARS FALSGNW YFT PQHKLSLTASHQERLPSTQELYAHGKH VATNT FEVGNKHLNKER 

I I I U I I I I I I I I I I I I : I I I I I 1 I I I I I E I I I I I I I I E I I I II I 1 I I i I I ! I t I 

gl4 9-l RQTARSFALSGNWYFTPHHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml4 9-1 . pep S NN I E LALG YEG D RWQ YNL AL Y RN RFGN Y I YAQTLNDGRG PKSIEDDS EMKL VR YNQS G A 
I I I I I I II I I I t I I I I I I I I IE I I E I I I I I I I | | | I | | | | | | | | | | | | | | | | | | | | | | | 
gl49-l SNNI ELALGYEG DRWQYNL AA YRNRFGN Y I YAQTLNDGRGPKS IEDDS EMKLVR YNQSGA 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml4 9-l .pep D F YG AEG EIYFKPTPRYRIGVSG D Y VRGRLKN L P S L PGRE DA YGN RP F I AQ D D QN AP RV P 
I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I 1 M | J | | | | I I : I I I I I 1 111111:1 
gl4 9-l DFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDPYGKRPFIAQADQNAPRIP 

610 620 630 640 650 660 
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ml49-l .pep 
g!49-l 



ml49-l .pep 
gl49-l 



670 680 6 9° 700 7i n 

1 1 1 ' M 1 1 ' • 1 ' ' 1 1 i 1 1 1 ii 1 1 1 1 1 1 u 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 , , , . , , 

AAMiGFHLKTSLTDRID^LDYYRVFAQNKLARYETRTPGHHMLNL^ 

° /u 680 690 700 



710 



720 



7 30 740 750 759 

WYVKADNLLNQS VYAHSS FLSDTPQMGRS FTGGVNVKFX 
1,11 UlilMIIIIMIIIIMJiii,! I|J|f(j||I 

WYVKADNLLNQS VYAHSS FLSDTPQMGRS FTGGVNVKFX 
730 740 750 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 605>: 

1 ATGGCACAAA CTACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 
CACACCCCTC CTCTCCCAAG CGCATGGAAC TGAGCAATCA GTGGGCTTGG 
AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCTCCAC TTCGGGGCTG 
CTGCACACTT CTACCGCCTC CGACAAAATC ATCAGCGGCG ACMctS 
ACAAAAAGCC GTCAACTTGG GTGATGCTTT AGACgIcGTA C^GGGCMTC 
ATGCCTCGCA ATACGGCGGC GGCGCATCCG CTCCCGTTAT TCGCGGTCAA 
GGATTAAAGT GTTGAACCAT CACGGCGAAA CGGGcS 
GGCGGACTTC TCTCCAGACC ATGCAATCAT GGTGGACAGC GCCTTCTCGT 
AACAGGTCGA AATCCTGCGC GGTCCGGTTA CGCTCTTGTA CAGCTCGGGC 
AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGCAAAATCC CCGaISt 
GCCTGAAAAC GGCGTATCGG GCGAACTCGG ATTGCGTTTG ATCaI^I 
^JS GAAAA ACTCAC GTCC GGCGGCATCA ATATCGGTTT SS 
TTTGTATTGC ACACGGAAGG GCTGTACCGC AAATCGGGGG ATTACGCCGT 
a*™™ CGCAATCTGA AACGCCTGCC CGACAGCCAC GCCGATTCGC 
AAACGGGCAG CATCGGGCTG TCTTGGGTTG GCGAAAAAGG CTTTATCGGC 
^ ATACA GCGACCG TCG CGACCAATAT GGTCTGCCTG CCcIcSa 
^r^ SAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGTTTGATTA 
A ^ AAACGCTA TTTGCA GCTT TATCCGCACC TGTTGACCGA AGAAGACATC 
GATTACGACA ATCCGGGCTT GAGCTGCGGC TTTCACGACG ACGATGATGC 
ACACGCCCAT GCCCACAACG GCAAACCTTG GATAGACCTG SS 
GCTACGAACT CCGCGCCGAA TGGAAGCAAC CGTTCCCCGG TTTTGAAGCC 
CTGCGCGTAC ACCTGAACCG CAACGACTAC CGCCACGACG aIaIa^AGG 
TTAACAACCA AACGCAAAAC StATCG 
CCAACCCATA GGCCGTCTGA AAGGCAGCTG GGGCGTGCAA 
TATTTGGGAC AAAAATCCAG TGCTTTATCT GCCACATCCG AAGCGGTCAA 
ACAACCGATG CTGCTTGACA ATAAAGTGCA ACATTACAGC TTTTTCGGTG 
G^™™ V™™*™ AACTTCACGC TTGAAGGCGG CGTACGCGTG 
GAAAAACAAA AAGCCTCCAT CCGCTACGAC AAAGCATTGA TTGATCGGGA 
AAACTACTAC AACCATCCCC TGCCCGACCT CGGCGCGCAC ScaSS 
^ G " CATT CGCACT ™ GGCAACTGGT ATTTCACGCC 
™ AG i CTGA CCGCCTC CCA TCAGGAACGC CTGCCGTCAA CGCAAGAGCT 
GTACGCACAC GGCAAACACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 
JSS^?** CAAAGflGCGT TCCAACAATA TCGAACTCGC GCTCGGC^AC 
GAAGGCGACC GCTGGCAATA CAATCTGGCA CTCTACCGCA ACC6CTTCG? 
CAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCcScCA 
TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGTTrrr 
GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA C^CGCGCTA 

CT^CCTACC CGGcS IS™""* ScTGC 

OTTCCCTACC CGGCAGGGAA GACGCCTACG GCAACCGCCC ACTCATTGCP 

CAAGCCGACC AAAACGCCCC TCGCGTTCCG GCTGCGCGCC TCGGCG^CCA 
CCTGAAAGCC TCGCTGACCG ACCGCATCGA TGCCAATTTG GAC^AC^ACC 
GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG CA^GCCCGGA 
CACCATATGC TCAACCTCGG CGCAAACTAC CGCCGCAATA CGCGCTATGG 
CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CMTCCG^ 
ACGCCCACAG CAGCTTCCTC TCTGATACGC CGCAAATGGG C^CAGCT^ 
ACCGGCGGCG TGAACGTGAA GTTTTAA CCGCAGCTTT 



51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 



This corresponds to 

al49-l.pep 

1 MAQTTLKPIV 



the amino acid sequence <SEQ ID 606; ORF 149-l.a>: 



51 LHTSTASDKI 

101 TGRRIKVLNH 

15 1 NVAGLVDVAD 

201 FVLHTEGLYR 

251 AAYSDRRDQY 

301 DYDNPGLSCG 



-f*f'- I ? P L L SOA HGTEOS VGLETVSVVG 
VNLGDALDGV PGIHASQYGG 
SPDHAIMVDS ALSQQVEILR 
GVSGELGLRL SSGNLEKLTS 
RNLKRLPDSH ADSQTGSIGL 
DCHADIIWQK SLINKRYLQL 
AHNGKPWIDL RNKRYELRAE 



ISGDTLRQKA 
HGETGDMADF 
GKIPEKMPEN 
KSGDYAVPRY 
GLPAHSHEYD 
FHDDDDAHAH 



KSRPRATSGL 
GASAPVIRGQ 
GPVTLLYSSG 
GGINIGLGKN 
SWVGEKGFIG 
YPHLLTEEDI 
WKQPFPGFEA 
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351 LRVHLNRNDY RHDEKAGDAV ENFFNNQTQN ARIELRHQPI GRLKGSWGVQ 

401 YLGQKSSALS ATSEAVKQPM LLDNKVQHYS FFGVEQANWD NFTLEGGVRV 

451 EKQKASIRYD KALIDRENYY NHPLPDLGAH RQTARSFALS GNWYFTPQHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 

551 EGDRWQYNLA LYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DAYGNRPLIA 

651 QADQNAPRVP AARLGVHLKA SLTDRIDANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

751 TGGVNVKF* 

al4 9-X/ml49-l 98.0% identity in 758 aa overlap 

10 20 30 40 50 60 

al4 9-l.pep MAQTTLKPIVLSILLINTPLLSQAHGTEQSVGLETVSWGKSRPRATSGLLHTSTASDKI 

I I ! I I I I I I I I I I 1 I I I I I I I : I I 1 II I II I I ! I I I I I I I I I I I | | | | | | | H 

ml4 9-l MAQTTLKPIVLSILLINTPLLAQAHETEQSVDLETVSVVGKSRPRATSGLLHTSTASDKI 

10 20 . 30 40 50 60 

70 80 90 100 110 120 

al4 9-l.pep ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 

I ! I I I I I I I I I I I I I i I I I I I I I I I I I I i I I I I M I I I I I I I I I I I I I I I I I t II I I I ! I 
ml 4 9-1 ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 
70 80 90 100 110 120 

130 140 150 160 170 180 

al4 9-l.pep SPDHAIMVDSALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGELGLRL 
I I I I I I I 1 I I I I I I I I I I I i I I I !] I M | | | I 1 I I I I I I I I I I I i | | | | | | | M I M I 
ml49-l SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGELGLRL 

130 140 150 160 170 180 

190 200 210 220 230 240 

al49-l.pep SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRKLKRLPDSHADSQTGSIGL 
I I M I I I M ! I I I I I I I I I I I I i I i I I I I I | | I | | | | | | M | | | | | | | | | | | | | | | | | | J 
ml 4 9-1 SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 
190 200 210 220 230 240 

250 260 270 280 290 300 

al4 9-l.pep SWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 
i M I II I I II : I I I I I I II I I I I I I I I | | | | | | ( i | | | | | | | | | | || | | | j | | | | | | | | | 
ml 4 9- 1 SWVGEKGFIGVAYSDRRDQYGLPAHSHE YDDCHADI I WQKSLINKRYLQLYPHLLTEEDI 

250 260 270 280 290 300 

310 320 330 340 350 360 

al4 9-l.pep DYDNPGLSCGFHDDDDAHAHAHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 
I ! M I I I I I I I i I I I : I i I I : I : I : I I I I | I | | | | | | | | I I I I I I I I I I I I I I I I 1 I I f I 
ml49-l DYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 

310 320 330 340 350 360 

370 380 390 400 410 420 

al49-l.pep RHDEKAGDAVENFFNNQTQNARI ELRHQPI GRLKGS WG VQYLGQKS SALS ATSEAVKQPM 

M I I H I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

IU149-1 RH D E KAG D A VEN F FNNQT QN AR I E L RH Q P I G RLKG S WG VQ YLQQKS SAL S A I S E A VKQPM 

370 380 390 400 410 420 

430 440 450 460 470 480 

a 14 9-1. pep LLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNHPLPDLGAH 
I I I I I I I I I I I I I I I E I I I M I I I I I I I I I I I I | | | | : | | | | I I I I I I I I I I I I I I I I I I 
ml49-l LLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQBCASIQYDKALIDRENYYNHPLPDLGAH 
430 440 450 460 470 480 

490 500 510 520 530 540 

al4 9-l.pep RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 

I I I I I I I i I I I | I | | | | J | M I I I I I I I I I I I M I I I I I I I I I 

ml 4 9-1 RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 

490 500 510 520 530 540 

550 560 570 580 590 600 

al49-l.pep S NN I ELALG YE G DRWQYN L AL YRN RFGN Y I YAQT LN DGRG PKS I E DD S EMKL VR YNQS G A 

I I M I I I I I I I t I I I I I I I I I I | | I I I I I | | | | I I I I I I I II I I I I I I 

ml49-l SNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 

550 560 570 580 590 600 

610 620 630 640 650 660 
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al«-l.p.p °^AEGEIYFKPTPRYRIGySG^ 

610 620 MO 640 650 660 



al49-X .pep ^f™^™^ 

570 680 690 700 710 7?n 



- . — , 730 740 750 759 

a!4 9-1 . pep WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 
J 1 ' (ill MM flMMI 111 IN | mi mi || if,,, 
ml49-l WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 
730 740 750 

2?o^ir^fS2? ° NA S6qUenCe W2S identifled in gonorrhoeae <SEQ ID 607>: 

1 . . TACTGCAAGG CAGACCCCTT TCCCGCCGCC CTGCTGGCCA ATCAGAAAAT 

51 CACCGCCCGC CAATCCGATA AAGACGTGCG CCACATCGAA mSJ™£ 

101 GCGGTTCGGA TTTGCACTAC CTCCCGGGCG ACGCGCTcS 

IV ^™ GATC CGGCACTGGT CGGGGAAATC CTAGACCTGC TCGGCATCAA 

2 01 TCCGGCAACG GAAATACAGG CGGGCGGAAA AACCCTGCCG GTTGCCTCCG 



CACTGTTATC CCATTTCGAA CTCACGCAAA ACACCCCCGC CTTTGTCAAA 
GGCTATGCCA CGTTCGCCGA TAATGACGAA CTCGACCGTA TTGCTGCCGA 
CAACGCCGTT TTGCAAGGCT TTGTGCAAAG CACGCCGATT GCCGGTGTGC 
Irrrnrt^Z CCCGGCAftAA CTGACGGCGG AACAATTCGC CGGCCTGCTG 
nn.n~»~ TG CGCCGCGCCT GTATTCGATT TCCTCGTCGC AGGCGGAAGC 
501 °I° CACCTGA CCGTCGGCGC AGTGCGTTTC GAACACGAAG 



251 
301 
351 
401 
451 



oi. i x^ULKiAA CGCAACGACG GCTTCAGGCT 

651 GCCCGAAGAC AGCCGCAAGC CGATTGTGAT GATCGGCTCC 

701 TCGCACCGTT CCGCGCCTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA 

751 GGCAGAAACT GGCTGATTTT CGGCAATCCG CATTTTGCCG CcSIcScCT 

801 CTATCAGACC GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTGC^CAGAT 

851 ATGACTTCGC CTGGTCGCGC GATCAGGAAG AAAAAATCTA TGTGcSSc 

901 AAAATCCGCG AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC 

951 GCATATCTAT GTGTGCGGCG ATGCGGCAAA AATGGCAAAA gSSSE 

1001 CCGCCTTGCT GGATGTGATT ATCGGGGCAG GGCATTCGGA <SSSS^ 

1101 SJSSSS ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA 



1051 



giso ^ eS ?p? d t ^f 16 amino acid se£ l uence <SEQ ID 608; ORF 150.ng>: 

^ ' "I^ ADPFPAA LLAK QKITAR QSDKDVRHIE IDLSGSDLHY LPGDALGVWF 

51 DNDPALVGE I LDLLGINPAT EIQAGGKTLP VASALLSHFE LTQNTPAPVK 

III ^11?™* LDRIAADNAV LQGFVQSTPI AGVLHRFPAK iSSSSZ 

IV ™ RLYSI SSSQAEAGDE VHLTVGAVRF EHEGRARAGG ASGFFADRLE 
201 EDGTVRVFAE RNDGFRLPED SRKPIVMIGS GTGVAP FRAF VQQRAAENAE 
301 ZLT HFAADFLYQT EWQQFAKDGF LHRYDFAWSR 2££K£ 
301 KIREQAEGLW QWLQEGAHIY VCGDAAKMAK EVEAALLDVI IGAGHSDEDG 
351 AEGYLDMLRE EKRYQRDVY* **WJV± IGAGHSDEDG 

I^o f sig ^ Partlal DNA set * uence was identified in N. meningitidis <SEQ ID 609>: 

1 ATGCAGAACA CAAATCCGCC ATTACCGCCT CTGCCGCCCG AAATCACGCA 

51 GCTCCTGTCG GGGCTGGACG CGGCACAATG GGCGTGGCTG TCCGGCTACG 

III AGCAGGAAAC GGGGCATCTG CAGGACTGCC £££££ 

IV n^ 0 ™ 0 CGGCGGCAGA ACCTTTTTCC GTAACCGTCC TTTCCGCCTC 
IV t^^ 0 ^ TGCGM ^ CCGTTGCCGA CAAAGCGGCG GACAGCCTGG 
251 AAGCCGCCGG CATCCAAGTC AGTCGCGCCG AACTGAAAGA CTATAAGGCG 
301 AAAAACATCG CCGGCGAACG CCGCCTGCTG CTGGTTACCT cScJSS 
351 CGAAGGCGAA CCGCCGAAAG AAGCCGTCGT GCTGCACAaI C GC gmS 
401 GCAAAAAAGC CCCGAAATTG GACAAACTCC AATTTGCCGT SSSSES 
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451 GGCGACAGTT CCTATCCGAA TTTCTGTCAG GCAGGTAAAG ATTTCGACCG 

501 GCGTTTTGAA GAATTGGGCG CAAAACGGCT GCTCGAACGC GTTGATGCGG 

551 ATTTGGACTT TACCGCCTCC GCAAACGCCT GGACAGATAA TATCGCCGCA 

601 CTCTTAAAAG AAGAAGCCGC AAAAAACCGG GCAACGCCCG CGCCGCAGAC 

651 AACGCCCCCC GCCGGCCTTC AGACGGCACC GGATGGCAGG TACTGCAAGG 

701 CAGCCCCCTT TCCCGCCGCC CTGCTGGCCA ATCAGAAAAT CACCGCCCGC 

751 CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA GCGGTTCGGA 

8 01 TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT GACAACGATC 

851 CGGCACTGGT CAGGGAAATC CTAGACCTGC TCGGCATCGA TCCGGCAACG 

901 GAAATACAGG CGGGCGGAAA GATGATGCCG GTTGCGCGCG CACTTTCATC 

951 TCATTTCGAA CTCACGCAAA ACACTCCGGC TTTCGTCAAA GGCTATGCCG 

1001 CGTTCGCCCA TTATGAAGAA CTCGATAAAA TCATTGCCGA TAACGCCGTT 

1051 TTGCAGGATT TCGTGCAAAA CACGCCTATT GTCGATGTGC TGCACCGCTT 

1101 CCCGGCAAGC CTGACGGCAG AACAATTCAT CCGTTTACTG CGTCCGCTTG 

1151 CACCCCGTTT GTATTCGATT TCTTCAGCAC AGGCGGAAGT GGGCGATGAA 

12 01 GTG CATTTAA CTGTCGGCGT GGTTCGTTTT GAACACGAAG GCCGCGCCAG 
1251 AACGGGCGGC GCATCGGGTT TCCTTGCCGA CCGGCTGGAA GAGGACGGCA 

13 01 CGGTGCGCGT GTTTGTGGAA CGCAACGACG GCTTCAGGCT GCCCGAAGAC 

13 51 AGCCGCAAGC CGATTGTGAT GATCGGCTCG GGCACCGGCG TCGCACCGTT 

14 01 CCGCGCTTTC GTCCAACAAC GTGC CGCAGA AAATGCGGAA GGCAAAAACT 
1451 GGCTGATTTT CGGCAATCCG CATTTTGCCC GTGATTTTCT CTATCAAACC 
1501 GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTG CACAGGT ACGATTTCGC 
1551 CTGGTCCCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC AAAATCCGCG 
1601 AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC GCATATCTAT 
1651 GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GACGTGGAAG CCGCCTTGCT 
1701 GGATGTGATT ATCGGGGCAG GACATTTGGA CGAAGAGGGC GCAGAAGAAT 
1751 ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 

This corresponds to the amino acid sequence <SEQ ID 610; ORF 150>: 

mlSO.pep 

1 MQNTNPPLPP LPPEITQLLS GLDAAQWAWL SGYAWAKAGN GASAGLPALQ 

51 TALPAAEPFS VTVLSASQTG NAKSVADKAA DSLEAAGIQV SRAELKDYKA 

101 KNIAGERRLL LVTSTQGEGE PPKEAWLHK LLNGKKAPKL DKLQFAVLGL 

151 GDSSYPNFCQ AGKDFDRRFE ELGAKRLLER VDADLDFTAS ANAWTDNIAA 

2 01 LLKEEAAKNR ATPAPQTTPP AGLQTAPDGR YCKAAPFPAA LLANQKITAR 
251 QSDKDVRHIE IDLSGSDLHY LPGDALGVWF DNDPALVREI LDLLGIDPAT 
301 EIQAGGKMMP VARALSSHFE LTQNTPAFVK GYAAFAHYEE LDKI I ADNAV 

3 51 LQDFVQNTPI VDVLHRFPAS LTAEQFIRLL RPLAPRLYSI SSAQAEVGDE 

4 01 VHLTVGWRF EHEGRARTGG ASGFLADRLE EDGTVRVFVE RNDGFRLPED 
451 SRKPIVMIGS GTGVAP FRAF VQQRAAENAE GKNWLIFGNP HFARDFLYQT 
501 EWQQFAKDGF LHRYDFAWSR DQEEKIYVQD KIREQAEGLW QWLQEGAHIY 
551 VCGDAAKMAK DVEAAIiLDVI IGAGHLDEEG AEEYLDMLRE EKRYQRDVY* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 150 shows 91.3% identity over a 369 aa overlap with a predicted ORF (ORF 150.ng) 
from N. gonorrhoeae: 

ml50/g!50 



210 220 230 240 250 260 

ml50 . pep LLKEE AAKNRAT PAPQTT P PAG LQTAPDGRYCKAAP FPAALLANQK I T ARQ S DK D VRH I E 

I I I I I I I I I I I I I I I I I I I I I M I I It I I 
9150 YCKADP FPAALLANQK I TARQS DKDVRHIE 

10 20 30 

270 280 290 300 310 320 

ml50 . pep IDLSGSDLHYLPGDALGVWFDNDPALVREILDLLGIDPATEIQAGGKMMPVARALSSHFE 
M I I 1 I II II I I I I I I I ! M I I ! I I II | I | | M | | : | | | | ( | | l | | : | l | |, 11 I I 
Sl50 IDLSGSDLHYLPGDALGVWFDNDPALVGEILDLLGINPATEIQAGGKTLPVASALLSHFE 

40 50 60 70 80 90 

330 340 350 360 370 380 

ml50 . pep LTQNTPAFVKGYAAFAHYEELDKIIADNAVLQDFVQNTPIVDVLHRFPASLTAEQFIRLL 
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gl50 

ml 50 . pep 
gl50 

ml50 . pep 
gl50 

ml50 . pep 
gl50 

ml 50 . pep 
gl50 



11,11,1111,11:11 : I I I : I I i I i | | f |||:|||: I f | 1 I I I • j ! I I I I I I 
LTQNTPA^KGYATFADNDELDR^^ 

100 110 120 130 140 150 



390 



400 



II III 1 ' 1 ' 1 : 1 1 1 1 1 1 ' 1 1 1 11 : 1 1 M 11 1 H " : I II I H • i I f M I I I I I I 
RPLAPRLYSISSSQAEAGDEVHLTVGAVRFEHEGRARAGGASGFFADRLEEE 

160 170 180 190 200 210 



2DGTVRVFAE 



450 



460 



470 



480 



490 



kndgfrlpedsrkpivmigsgtgvapfra^^ 

250 260 270 



220 



230 



240 
530 



510 520 530 540 550 Sfin 

1 1 1 1 M 1 M 1 1 ' I I M M M M M I M M M I M M M M M M M M I I 1 I I l l i i i I i 
ewqqfakdgflhrydfawsrdqeekiyvqdkireqae 

280 290 300 310 320 330 



570 



580 590 600 

dveaalldviigaghldeegaeeyldmlreekryqrdvyx 
:|||I "MIIII!I MMM MMMMMMfMM 
eveaalldviigaghsdedgaegyldmlreekryqrdvyx 

340 350 360 



370 



The following partial DNA sequence was 

al50. seq 

atgcagaaca caaatccgcc 
gctcctgtcg gggctggacg 
cttgggcaaa agcaggaaac 
acggcattgc cgacggcaga 
gcaaaccggc aatgcgaaat 
aagccgccgg catccaagtc 
aaaaacatcg ccggcgaacg 
cgaaggcgaa ccgccggaag 
gcaaaaaagc cccgaaattg 
ggcgacagct cctatccgaa 
acgttttgaa gaattgggcg 
atttggactt tgccgccgcc 
ctcttaaaag aagaagccgc 
aacgcccccc gccggccttc 
cagacccctt tcccgccgcc 
caatccgata aagacgtgcg 
tttgcactac ctcccgggcg 
cggcactggt cagggaaatc 
gaaatacagg cggg cggaaa 
ccattttgaa ctcacgcaaa 
cgttcgccga tgatgacgaa 
ttgcaaggct ttgtgcaaag 
cccggcaaaa ctgacagcgg 
cgccgcgcct gtattcgatt 
gtgcacctga ccgtcggcgc 
ggcgggcggc gcatcgggtt 
cggtgcgcgt gtttgtggaa 
agccgcaagc cgattgtgat 
ccgcgctttc gtccaacaac 
ggctgttttt cggcaatccg 
gaatggcagc agtttgccaa 
ctggtcgcgc gatcaggaag 
aacaggcgga aggactttgg 
gtgtgcggcg atg cggcaaa 



identified in N, meningitidis <SEQ ID 61 1>: 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 



ATTACCGCCT 
CGGCACAATG 
GGGGCATCTG 
ACCTTTTTCC 
CCGTTGCCGA 
AGTCGCGCCG 
CCGCCTGCTG 
AAGCCGTCGT 
GACAAACTCC 
TTTCTGCCGG 
CAAAACGCCT 
GCAGACGGAT 
AAAAAACCGG 
AGACGGCACC 
CTGCTGGCCA 
CCACATCGAA 
ACGCGCTCGG 
CTAGACCTGC 
AACCCTGCCG 
ACACCCCCGC 
CTCGACCGTA 
CACGCCGATT 
AACAATTCGC 
TCCTCGTCGC 
GGTGCGTTTC 
TCCTTGCCGA 
CGCAACGACG 
GATCGGCTCG 
GTGCCGCAGA 
CATTTTGCCC 
AGACGGCTTC. 
AAAAAATCTA 
CAATGGCTGC 
AATGGCAAAA 



ATGCCGCCCG 
GGCGTGGCTG 
CAGGACTGCC 
GTAACCGTCC 
CAAAGCGGCG 
AACTGAAAGA 
CTGGTTACCT 
GCTGCACAAA 
AATTTGCCGT 
GCGGGCAAAG 
GCTCGAACGC 
GGACAGATAA 
GCAACGCCCG 
GGATGGCAGG 
ATCAGAAAAT 
ATCGATTTGA 
CGTTTGGTTT 
TCGGCATCGA 
GTTGCCTCCG 
CTTTGTCAAA 
TTGCTGCCGA 
GCCGATGTGC 
CGGCCTACTG 
AGGCGGAAGT 
GAACACGAAG 
CCGGCTGGAA 
GCTTCAGGCT 
GGCACCGGCG 
AAATGCGGAA 
GTGATTTTCT 
CTGCACAGAT 
TGTGCAGGAC 
AGGAAGGCGC 
GACGTGGAAG 



AAATCACGCA 
TCCGGCTACG 
CGCGCTTCAG 
TTTCCGCCTC 
GACAGCCTGG 
CTATAAGGCG 
CCACCCAAGG 
CTGCTGAACG 
ACTGGGTTTG 
ATTTCGACAA 
GTTGATGCGG 
TATCGCCGCA 
CGCCGCAGAC 
TACTGCAAGG 
CACCGCCCGC 
GCGGTTCGGA 
GACAACGATC 
TCAGGCAACG 
CACTGTTATC 
GGCTATGCCC 
CAACGCCGTT 
TGCACCGCTT 
CGCCCGCTTG 
GGGGGACGAA 
GGCGCGCCAG 
GAGGACGGCA 
GCCCGAAGAC 
TCGCACCGTT 
GGCAAAAACT 
CTATCAAACC 
ACGATTTCGC 
AAAATCCGCG 
GCATATCTAT 
CCGCCTTGCT 
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1701 GGATGTGATT ATCGGGGCAG GACATTTGGA CGAAGAGGGC GCAGAAGAAT 
1751 ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 

This corresponds to the amino acid sequence <SEQ ID 612; ORF 150.a>: 

a!50 . pep 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



MQNTNPPLPP MPPEITQLLS GLDAAQWAWL SGYAWAKAGN 



TALPTAEPFS VTVLSASQTG NAKSVADKAA DSLEAAGIQV 
KNIAGERRLL LVTSTQGEGE 
GDSSYPNFCR AGKDFDKRFE 



PPEEAWLHK LLNGKKAPKL 
ELGAKRLLER VDADLDFAAA 



LLKEEAAKNR ATPAPQTTPP AGLQTAPDGR YCKADPFPAA 
QSDKDVRHIE IDLSGSDLHY LPGDALGVWF DNDPALVREI 



EIQAGGKTLP VASALLSHFE 
LQGFVQSTPI ADVLHRFPAK 
VHLTVGAVRF EHEGRARAGG ASGFLADRLE 
SRKPIVMIGS GTGVAPFRAF VQQRAAENAE 
EWQQFAKDGF LHRYDFAWSR DQEEKIYVQD 
VCGDAAKMAK DVEAALLDVI IGAGHLDEEG AEEYLDMLRE 



LTQNTPAFVK GYAPFADDDE 
LTAEQFAGLL RPLAPRLYSI 
EDGTVRVFVE 
GKNWLFFGNP 
KIREQAEGLW 



GASAGLPALQ 
SRAELKDYKA 
DKLQFAVLGL 
ADGWTDNIAA 
LLANQKITAR 
LDLLGIDQAT 
LDRIAADNAV 
SSSQAEVGDE 
RNDGFRLPED 
HFARDFLYQT 
QWLQEGAHIY 
EKRYQRDVY* 



m!50/al50 



94.8% identity in 599 aa overlap 
10 

ml 50 .pep 



20 



30 



40 



50 



60 



MQNTNPPLPPLFPEITQLLSGLDAAQWAWLSGYAWAKAGNGASAGLPALQTALPAAEPFS 

If J : I I I I I I ! I I I I I f| I I | | | | | | M I I I I I I I ! I I ! 1 I ! I I I ! I : I I f f I 

^150 MQNTNPPLPPMPPEITQLLSGLDAAQWAWLSGYAWAKAGNGASAGLPALQTALPTAEPFS 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml 50 . pep VTVLSASQTGNAKSVADKAADSLEAAGIQVSRAELKDYKAKNIAGERRLLLVTSTQGEGE 
I I I I I I I I I I I I I I I I 1 I I I f I I I I I ! I I I I ! M f I I I 1 I 1 I I I I I I I M I I I I I I I I I I 
al50 VTVLSASQTGNAKSVADKAADSLEAAGIQVSRAELKDYKAKNIAGERRLLLVTSTQGEGE 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml50 . pep PPKEAWLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCQAGKDFDRRFEELGAKRLLER 
I I : I I I I I I I I I I I I I I I I t I I I I I I I I I I M I I I I I I I : I I I E I I : | I | | | | | | | | | { I 
al50 PPEEAVVLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCRAGKDFDKRFEELGAKRLLER 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 50 . pep VDADLDFTASANAWTDNIAALLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKAAPFPAA 
M I I I I I : I : I :: I I I I i I I I I I I I I I I I I | | | | | | | I I I I I I I I I I I f ! 1 I I I I I I I I 
a 15 0 VDADLDFAAAADGWTDNIAALLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKADPFPAA 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml50 . pep LLANQKITARQSDKDVRHIEIDLSGSDLHYLPGDALGVWFDNDFALVREILDLLGIDPAT 
I I I I I I I I I I i I I 1 I II I I I I I I I I I II I I | I I I I I I I ! I I I t I I I E I I I I M I I I I II 
al50 L LANQK I TARQSDKDVRHIE I DLSGSDLHYLPGDALGVWFDNDPALVRE I LDLLGIDQAT 

250 260 270 280 290 300 

310 320 330 340 350 360 

m!50 .pep E I QAGG KMM P VARALS S H FELT QNT P AFVKG Y AA FAH YEEL DK 1 1 ADN A VLQDFVQNT P I 

I I t I I I I : M I II I I I I I I I I | | M | | | | | || : I | | : | | I | I II I I I I : I I I 
alSO EIQAGGKTLPVASALLSHFELTQNTPAFVKGYAPFADDDELDRIAADNAVLQGFVQSTPI 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 5 0 . pep VDVLHRFPASLTAEQFIRLLRPLAPRL YS I SSAQAEVGDEVHLTVGWRFEHEGRARTGG 

: I I M I I I I : II I I I I I 1 I I II I I I I I I I I : I I I I I I I I j M I I : I I I I I I I If I : I 1 
alSO ADVLHRFPAKLTAEQFAGLLRPLAPRLYS I SSSQAEVGDE VHLTVGAVRFEHEGRARAGG 

370 380 390 400 410 420 

430 440 450 460 470 480 

mlSO.pep ASGFLADRLEEDGTVRVFVERNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAE 
I I I II 1 I I I I I I I I I I I | | | | | | | | | | | | | | | 1 | | | | | | | | | | | | | | | | | | | | | | | | | | | 
alSO ASGFLADRLEEDGTVRVFVERNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAE 



ml50 .pep 
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440 



450 



460 



470 



480 



490 500 510 520 530 540 

GKNWL I FGN PH FARD FL YQT E WQQ FAK DG FLH R Y DF AW S RDQE EKIYVQDKI REQAEGLW 
I I I i I : I I I I I I M I If I I I I I I I I I I I | | | | | | | | || | | | | | | | | | | | | | | | j | | | | ) ) 
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3150 GKNWLFFGNPHFARDFLYQTEWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLW 
490 500 510 520 530 540 

550 560 570 580 590 600 

ml50 .pep QWLOEGAH1YVCGDAAKMAKDVEAALLDVIIGAGHLDEEGAEEYLDMLREEKRYQRDVYX 

1 1 I I ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I HI | | | | | | | | I I ,, 

al50 QWLQEGAHI YVCGDAAKMAKDVEAALLDVI I GAGHLDEEGAEEYLDMLREEKRYQRDVYX 

550 560 570 580 590 600 

The following partial DNA sequence was identified in ^.gonorrhoeae <SEQ ID 613>- 

gl5l . seg ^ 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCACACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTG 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TGGAGCGCGT TTTGGGGATG GTGGATTGCG TCGTCTTGTT GGTGGACGCA 

3 01 CAGGAAGGTC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 
351 TTTGGGGCTG AAACCGATTG TCGTCATCAA CAAAATCGAC AAACCGTCCG 

4 01 CCCGTCCGAG CTGGGTTATC GACCAGACTT TCGAGTTGTT CGACAACTTG 
4 51 GGTGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTACG CTTCAGGTTT 
501 GAGCGGCTTT GCCAAGCTGG AAGAAAccga CGAGAGCAGC GATATGCGCC 
551 CGCtgttcgA CACCATCCTA AAATACAcgc ctgCACCGAG CGGCAGCGCG 
601 GACGAGCCGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 
651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGCATC AAACCCGGCC 
701 AAACCGTTGC CGTGATGAAC CACGAGCAGC AAATCGCCCA AGGCCGCATC 
751 AACCAGCTTT TGGGTTTCAA AGGCTTGGAA CGCGTGCCGC TTGAAGAAGC 
801 CGAAGCCGGC GACATTGTGA TTATTTCCGG TATCGAAGAC ATCGGCATCG 
851 GCGTAACCAT CACCGACAAA GACAACCCCA AAGGCCTGCC GATGTTGAGC 
901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTAAACA CCAGCCCGCT 
951 CGCAGGTACA GAAGGCAAAT TCGTGACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTTGCCC TGCGCGTGGA AGACACCGCC 

1051 GatgCCGACG TGTTCCGCGT ATCcgGGCGC GGCGAACTGC ACCTGACGAT 

1101 TTTGCTGGAA AATATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAGC 

1151 CGCGCGTCGT GTACCGAGAC ATCGACGGTC AAAAATGCGA ACCTTATGAA 

1201 AACCTGACTG TGGACGTACc cgacgacAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACCAA TATGGAAAGC GACGGCAACG 

13 01 GacgCACCCG CCTCGAATAC CATATTCCAG CGCGCGGCTT GATCGGTTTC 

13 51 CAAGGCGAAT TCATGACCCT GACGCGCGGC GTCGGGCTGA TGAgccacGT 

14 01 GTTcgacgac tacgcgcccg t caAACCCGA TATGCCCGGC CGCCACAACG 
1451 GCGTactggt GtcccaAGAG CAGGGCGAGG CGGTTGCTTA CGCCTTGTGG 
1501 AATCTTGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAAT CTA 
1551 CGAAGGTATG ATTATCGGCA TCCACAGCCG CGACAACGAT TTGGTGGTCA 
1601 ACCCGCTCAA AGGCAAAAAA CTCACCAATA TCCGTGCCAG CGGTACCGAC 
1651 GAAGCGGTGC GCCTGACCAC GCCGATCAAA CTGAcgcTGG AAGG CGCGGT 
1701 CGAGTTTATC GACGATGACG AGCTGGTGGA AATCACGCCG CAAtccatCC 
1751 gcctgcgcat gcgttacctG AGCGaattgg aacgccgccg t caTTTTAAA 
1801 AagctgGATT AA 

This corresponds to the amino acid sequence <SEQ ID 614- ORF 151 ne>- 
glSl.pep ' ■ 5 * 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGCHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESS DMRPLFDTIL KYTPAPSGSA 

2 01 DEPLQLQ I SQ LDYDNYTGRL GIGRILNQRI KPGQTVAVMN HEQQIAQGRI 
251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

3 01 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 
351 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 

4 01 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HI PARGLIGF 
451 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 
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501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRMRYL SELERRRHFK 

601 KLD* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 615>: 

mlSl.seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCCCACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTA 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TAGAGCGCGT TTTGGGGATG GTGGACTGCG TCGTCTTGTT GGTGGACGCG 

3 01 CAGGAAGGCC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 
351 TTTGGGGCTG AAACCGATTG TCGTCATCAA CAAAATCGAC AAGCCGTCCG 
401 CTCGTCCGAG CTGGGTTATC GACCAAACTT TCGAGCTGTT CGACAACTTG 

4 51 GGCGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTACG CTTCAGGGTT 
501 GAGCGGTTTC GCCAAATTGG AAGAAACCGA CGAGAGCAAC GACATGCGTC 
551 CGCTGTTCGA TACTATCTTA AAATATACGC CTGCACCGAG CGGCAGCGCG 
6 01 GACGAAACGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 
651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGCATC AAACCCGGCC 
701 AAACCGTTGC CGTCATGAAC CACGATCAGC AAATCGCCCA AGGCCGCATC 
751 AACCAGCTTT TGGGTTTCAA AGGTTTGGAA CGCGTGCCGC TTGAAGAAGC 
801 CGAAGCCGGC GACATCGTGA TTATTTCCGG TATCGAAGAC ATCGGTATCG 
851 GCGTAACCAT CACCGACAAA GACAATCCCA AAGGCCTACC GATGTTGAGC 
901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTCAACA CCAGCCCGCT 
951 GGCGGGTACG GAAGGCAAAT TCGTAACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTCGCCC TGCGCGTGGA AGATACCGCC 

1051 GATGCCGACG TGTTCCGCGT ATCCGGGCGC GGCGAGCTGC ACCTGACCAT 

1101 TTTG CTGGAA AACATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAAC 

1151 CGCGCGTCGT GTACCGCGAC ATCGACGGTC AAAAATGCGA ACCGTATGAA 

1201 AACCTGACCG TGGATGTACC CGACGACAAC CAAGG CGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACTAA TATGGAAAGC GACGGCAACG 

13 01 GACGCACCCG CCTCGAATAC CATATTCCAG CGCGCGGCTT GATCGGTTTC 

13 51 CAAGGCGAAT TTATGACCCT GACGCGCGGG GTCGGGCTGA TGAGCCACGT 

14 01 GTTCGACGAT TACGCGCCCG TCAAACCCGA TATGCCCGGC CGCCACAACG 
1451 GCGTGCTGGT GTCCCAAGAG CAGGGCGAGG CAGTCGCTTA CGCCTTGTGG 
1501 AATCTGGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 
1551 CGAAGGCATG ATTATCGGCA TCCACAGTCG CGACAACGAT TTGGTGGTCA 
1601 ACCCGCTCAA AGGCAAAAAA CTTACCAACA TCCGTGCCAG CGGTACCGAC 
1651 GAAGCCGTTC GCCTGACCAC GCCAATCAAG CTGACGCTGG AAGGTGCGGT 
1701 TGAGTTTATC GACGATGACG AACTCGTTGA AATCACGCCG CAATCCATCC 
1751 GTCTGCGCAA GCGTTACTTG AGCGAATTGG AACGCCGCCG CCACTTTAAA 
1801 AAGCTGGATT GA 

This corresponds to the amino acid sequence <SEQ ID 616; ORF 151>: 
ml5l .pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGYHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESN DMRPLFDTIL KYTPAPSGSA 

201 DETLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQTVAVMN HDQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

3 01 VDEPTLTMDF MVNTS PLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

3 51 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 

401 NLTVDVPDDN QGAVME ELGR RRGELTNMES DGNGRTRLEY HI PARGLIGF 

451 QGEFMTLTRG VGLMS HVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 

501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRKRYL SELERRRHFK 

601 KLD* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N gonorrhoeae 

ORF 151 shows 99.2% identity over a 603 aa overlap with a predicted ORF (ORF 151 ng) 
trom N. gonorrhoeae: &J 
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gl51 KLDX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 61 7>: 

al51 . seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCCCACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTA 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TAGAGCGAGT TTTGGGGATG GTGGACTGCG TCGTCTTGTT GGTGGACGCG 

301 CAGGAAGGCC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

351 TTTGGGGCTG AAACCGATTG TCGTCATCAA TAAAATCGAC AAACCGTCCG 

4 01 CCCGTCCGAG CTGGGTCATC GACCAAACTT TCGAGCTGTT CGACAACTTG 

4 51 GGCGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTATG CTTCCGGTCT 

501 GTCCGGTTTC GCCAAATTGG AAGAAACCGA CGAGAGCAAC GACATGCGTC 

551 CGCTGTTCGA TACT AT CT T A AAATATACGC CTGCACCGAG CGGCAGCGCG 

601 GACGAAACGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 

651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGTATC AAGCCCGGTC 

701 AAGTTGTTGC CGTCATGAAC CACGATCAAC AAATCGCCCA AGGCCGCATC 

751 AACCAGCTTT TGGGTTTCAA AGGTTTAGAA CGCGTGCCGC TTGAAGAAGC 

801 CGAAGCCGGC GACATCGTGA TTATTTCCGG TATTGAAGAC ATCGGCATCG 

851 GCGTAACCAT CACCGACAAA GACAACCCCA AAGGCCTGCC GATGTTGAGC 

901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTCAACA CCAGCCCGTT 

951 GGCAGGTACG GAAGGCAAAT TCGTAACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTCGCCC TGCGCGTGGA AGATACCGCC 

1051 GATGCCGACG TGTTCCGCGT ATCCGGGCGC GGCGAGCTGC ACCTGACCAT 

1101 TTTGCTGGAA AACATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAAC 

1151 CGCGCGTCGT GTACCGCGAC ATCGACGGTC AAAAATGCGA ACCGTATGAA 

1201 AACCTGACCG TGGACGTACC CGACGACAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACTAA TATGGAAAGC GACGGCAACG 

1301 GACGCACCCG CCTCGAATAC CATATTCCAG CGCGCGGCTT GATCGGCTTC 

1351 CAAGGCGAAT TTATGACCCT GACGCGCGGG GTCGGGCTGA TGAGCCACGT 

14 01 GTTCGACGAT TACGCGCCCG TCAAACCCGA TATGCCTGGC CGCCACAACG 

1451 GCGTGCTGGT GTCCCAAGAG CAGGGCGAGG CAGTCGCTTA CGCCTTGTGG 

1501 AATCTGGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 

1551 CGAAGGTATG ATTATCGGCA TCCACAGTCG CGACAACGAT TTGGTGGTCA 

1601 ACCCGCTCAA AGGCAAAAAA CTTACCAACA TCCGTGCCAG CGGTACCGAC 

1651 GAAGCCGTTC GCCTGACCAC GCCGATTAAG CTGACGCTGG AAGGTGCGGT 

1701 CGAGTTTATC GACGATGATG AGCTGGTAGA AATCACGCCG CAATCCATCC 

17 51 GTCTGCGCAA GCGTTACTTG AGCGAATTGG AACGCCGCCG CCATTTCAAA 

1801 AAGCTAGATT GA 

This corresponds to the amino acid sequence <SEQ ID 618; ORF 15La>: 

al51 .pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGYHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESN DMRPLFDTIL KYTPAPSGSA 

201 DETLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQWAVMN HDQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

301 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

351 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 

4 01 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 

4 51 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 

501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRKRYL SELERRRHFK 

601 KLD* 

ml51/al51 99.8% identity in 603 aa overlap 
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S?2 f ^eq Wing Partial ° NA Sequence was ide ntified in N.gonorrhoeae <SEQ ID 61 9>: 

l ATGAAAAaca aAACCaaagt ctgGGacttc cCcacccgcc ttTTCCactG " ' ' 
ill Tc.lTlnnr f. CKKCCt SC CCTTTATGTG gtatagCGCA AAAGCCGGCG 
101 GcgataTGCT GcaatgGCAC ACGCGCGTCG GGCTGCTCGT CCTTTTCCTG 
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151 CTCGTATTCC GCCTCTGCTG GGGCATTTGG GGCAgcgATA CCGCCCGTTT 

2 01 CTCccgTtTC GTCCGAGGTT GGGCAGGTAT ACGCGGCTAT CTGAAAAAcg 
251 gCATTCCCGA ACAtatcCAG CCCGGACACA ACCCCTTGGG CGCACTgatg 

3 01 gtcGTTGCGC TTTTGgccgc cgtcTCATTT CAagtcggcA CGGGGCTTTT 

3 51 Tgccgccaat gaaaacacct tcagcaCCAa cggctacctc aaccatttgg 
401 tttccgaaca tacgGGCAGC CTTATACGGA AAATCCACCT CAACTTTTTC 

4 51 AAGCTGCTCG CCGTTTTTTC CGCAGTCCAC ATCGCCGCCG TCGCCGCATA 
501 CCGCATATTC AAAAAGAAAA ACCTCGTCCG CCCGATGATA ACCGGCTTCA 
551 AATACATCGA AGGCAAAACC TCAATCCGCT TTGCCGGCAA AGCCGCGCTT 
601 GCCGCCGCAT TATCGGTTGC CGCGCTTGCC GCAGCCGCCA TCCTGCTCCT 
651 GTCCTGA 

This corresponds to the amino acid sequence <SEQ ED 620; ORF 152.ng>: 



1 MKNKTKVWDF PTRLFHWLLA ASLPFMWYSA KAGGDMLQWH T RVGLLVLFL 

51 LVFRLCWGIW GSDTARFSRF VRGWAGIRGY LKNGIPEHIQ PGHNF LGALM 

101 WALLAAVSF QV GTGLFAAN ENTFSTNGYL NHLVSEHTGS LIRKIHLNFF 

151 KLLAVFSAVH IAAVA AYRIF KKKNLVRPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVAALA AAAILLL S* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 62 1>: 



1 ATGAAAAACA AAACCAAAGT CTGGGACCTC CCCACCCGCC TTTTCCACTG 

51 GCTGCTTGCC GCGTCCCTGC CCTTTATGTG GTATAGCGCG AAAGCCGGCG 

101 GCGATATGCT GCAATGGCAC ACGCGCGTCG GGCTGTTCGT CCTTTTCCTG 

151 CTCGTATTTC GCCTCTGCTG GGGCATTTGG GGCAGCGATA CCGCCCGTTT 

2 01 TTCCCGTTTC GTCCAAGGCT GGGCAGGCAT ACGCGGCTAT CTGAAAAACG 
251 GTATTCCCGA ACACATCCAG CCCGGACACA ACCCCTTGGG CGCACTGATG 

3 01 GTCGTTGCGC TTTTGGCCGC CGTGTCCTTC CAAGTCGGCA CCGGGCTTTT 

3 51 TGCCGCCGAT GAAAACACCT TCAGCACCAA CGGCTACCTC AACCATTTGG 
401 TTTCCGAACA TACGGGCAGC CTTATGCGGA AAATCCACCT CAACTTTTTC 

4 51 AAGCTGCTCG CCGTTTTTTC TGCAATCCAC ATCGCCGCCG TCGCCGCATA 
501 CCGCGTATTC AAAAAGAAAA ACCTCATCCT CCCGATGATA ACCGGCTTCA 
551 AATACATCGA AGGCAAAACC TCAATCCGCT TTGCAGGCAA AGCCGCGCTT 
601 GCCGCCGCAT TATCGGTTGC CTCGCTTGCC GCAGCCGCCA TCCTGCTCCT 
651 GTCCTGA 

This corresponds to the amino acid sequence <SEQ ID 622; ORF 152>: 

ml52 .pep 

1 MKNKTKVWDL PTRLFHWLLA ASLPFMWYSA KAGGDMLQWH T RVGLFVLFL 

51 LVFRLCWGI W GSDTARFSRF VQGWAGI RGY LKNGIPEHIQ PGHNPLGALM 

101 WALLAAVSF QV GTGLFAAD ENTFSTNGYL NHLVSEHTGS LMRKIHLNFF 

151 KLLAVFSAIH I AAVAA YRVF KKKNLILPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVASLA AAAILLL S* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 152 shows 95.4% identity over a 218 aa overlap with a predicted ORF (ORF 152.ng) 
from N. gonorrhoeae: 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 623>: 

al52 .seq 

1 ATGAAAAACA AAACCAAAGT CTGGGACTTC CCCACCCGCC TTTTCCACTG 

51 GCTGCTTGCC GCATCCCTAC CCTTTATGTG GTATAGCGCG AAAACCGGCG 

101 GCGATATGCT GCAATGGCAC ACGCGCGTCG GGCTGTTTAT CCTTTTCCTG 

151 CTCGTATTCC GCCTCTGCTG GGGCATTTGG GGCAGCGATA CCGCCCGTTT 

201 CTCCCGTTTC GTCCGCGGAT GGTCGGGTAT CAGAGAGTAT ATGAAAAACG 

2 51 GTATTCCCGA ACACGTCCAA CCCGGACACA ACCCCTTGGG CGCACTGATG 

301 GTCGTTGCGC TTTTGGCCGC CGTGTCGTTC CAAGTCGGCA CAGGGCTTTT 

351 TGCCGCCGAT GTAAACACCT TCAGCACCAA CGGCTACCTC AACCATTTGG 

4 01 TTTCCGAACA TACGGGCAGC CTTATGCGGA AAATCCATCT CAACTTTTTC 

4 51 AAACTGCTCG CCGTTTTTTC CGCAGTCCAC ATCGCCGNCG TCGCCGCATA 

501 CCGCGTGTTC AAAAAGAAAA ACCTCGTCCT CCCGATGATA ACCGGCTTCA 

551 AATACATCGA AGGCAAAACC TCAATCCGCT TTGCCGGCAA AGCCGCGCTT 

601 GCCGCCGCAT TATCGGTTGC CGCGCTTGCC GCAGCCGCCA TCCTGCTCCT 

651 GTCCTGA 

This corresponds to the amino acid sequence <SEQ ID 624; ORF 152.a>: 

al52 .pep 

1 MKNKTKVWDF PTRLFHWLLA ASLPFMWYSA KTGGDMLQWH TRVGLFILFL 

51 LVFRLCWGI W GSDTARFSRF VRGWSGIREY MKNGIPEHVQ PGHNPLGALM 

101 WALLAAVSF QV GTGLFAAD VNTFSTNGYL NHLVSEHTGS LMRKIHLNFF 

15 1 KLLAVFSAVH IAXVA AYRVF KKKNLVLPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVAALA AAAILL LS* ~~ 

ml52/al52 94.0% identity in 218 aa overlap 
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ml52 . pep ENTFSTNGYLNHLVSEHTGSLMRKIHLNFFKLLAVFSAIHIAAVAAYRVFKKKNLILPMI 
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VNTFSTNGYLNHLVSEHTGSLMRKIHLNFFKLLAVFSAVHIAXVAAYRVFKKKNLVLPMI 
130 140 150 160 170 180 



al52 



190 200 210 219 

ml52 . pep TGFKYIEGKTSIRFAGKAALAAALSVASLAAAAILLLSX 
N I I I I I II I I I ! II I I I II I I I I I I I : | | | M | | | M | 
al52 TGFK YI EGKT S I R FAGKAALAAAL S VAALAAAAI LLLSX 

190 200 210 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 625>: 

gl53.aeq 

1 atggggtttg cttaCAgtat gacgtatatc gaggtCGGGa taccggaggc 

51 ggcatccgtc ctttCgctGC CCGAGATgat gcgcctgatG GTGTTtCagg 

101 at t ATGGTTT TttggcCGAA GTGATGTTTG TGctgaCTTT cGGCGcgcCG 

1S1 GTTCTGTTtC TGCTGCTGTG CCTGTATGTC TATGCCGCGC TGATACGGAA 

201 ACAGGCGTAT CCTGCGCTGC GTTTGGCAAC GCGTGTGATG GTGCG CTTGA 

251 GGCAGGCGAT GATGGTGGAT GTGTTTTTTG TTTCCACTCT GGTGG CGTAT 

301 ATCAAGCTCT CGTCTGTGGC AAAGGTTCGC TTCGGGCCGG CGTTTTATCT 

351 GATGTTCGCG CTGTCGGTTA TGCTGATTCG GACTTCGGTA TCGGTTCCCC 

401 AG CATTGGGT GTATTTCCAA ATCGGGCGGC TGACGGGGAA TAATG CGGTT 

451 CAGACGGCAT CGGAAGGCAA AACCTGTTGC AGCCGCTGCC TGTATTTccg 

501 cgacAGTgcc gaatccCCCT GCGGGGTGTg cgGCGcggaA CTgtacggcg 

551 gacggccgaa aagtCTGAGt atttCgtCGG CGTTTCTgac ggcggcggTT 

601 GTTTTGTATT TCCctgCcaa TATCctgccg attaTGAttt cgtccAATCc 

651 tgccgccacg GAGGcCAACA CCATCTTTAG CGGCATCGCT TATATGTGGG 

701 ACGagggcgA CAGGCTGATT GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 

751 GTGCCGGTGC TGAAGATTGC GGCAATGTCG GTTTTGATTG CGGCGGCACG 

801 GTTCGCTTTG CCGGCGGGCG CAAAGAAATT GTCGCACCTC tacCGCATCA 

851 CCGAAGCGGT CGGCCGCTGG TCGATGATTG ATATTTTTGT GATTATTATT 

901 TTGATGTGTT CGTTCCacaC TTATGCCG CG CGCGTCATTC CGGGCAGTGC 

951 GGCAGTCTAT TTCTGCCTGG TCGTGATTTT GACGATGCTG TCCGCCTATT 

1001 ATTTCGACCC GCGCCTGCTT TGGGACAAAC GCGCTTCAGA CGGCATTGCT 

1051 TTCAACGAAA CGGAAAAATA TGACTGA 

This corresponds to the amino acid sequence <SEQ ID 626; ORF 153.ng>: 

gl53 -pep 

1 MGFAYSMTYI EVGIPEAASV LSLPEMMRLM VFQDYGFLAE VMFVLTFGAP 

51 VTjFLiLLC LYV YAALIRKQAY PALRLATRVM VRLRQ AMMVD VFFVSTLVAY 

101 IKL SSVAKVR FGP AFYLMFA LSVMLIRTSV SVPQHWVYFQ IGRLTGNNAV 

151 QT AS EGKTCC SRCLYFRDSA ESPCGVCGAE IiYGGRPKS LS ISS AFLTAAV 

201 VLYFPANI LP I MISSNPAAT EANTIFSGIA YMWDEGDRLI AAVIFSASIL 

251 VPVLKIAAMS VLIAAA RFAL PAGAKKLSHL YR I TE AVGRW SMIDIFVIII 

301 LMCSFHTYAA RVIPGSAAVY FCLWILTM L SAYYFDPRLL WDKRASDG I A 

351 FNETEKYD* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 627>: 

ml53 . seq 

1 ATGGCGTTTG CTTACGGTAT GACGTATATC GAGGTCGGGA TACCGGGTGC 

51 GGCATCCGTC CTTTCGCTGC CCGAGATGAT GCGCCTGATG GTGTTTCAGG 

101 ATT ATGGTTT TTTGGCCGAA GTGATGTTTG TGCTGACTTT CGGCGCGCCG 

151 GTTCTGTTTC TGCTGCTGTG CCTGTATGTC TATGCCGCGC TGATACGGAA 

201 ACAGGCGTAT CCTGCGCTGC GTTTGGCAAC GCGTGTGATG GTGCG CTTGA 

251 GACAGGCGAT GATGGTGGAT GTGTTTTTTG TTTCCACTTT GGTGG CGTAT 

301 ATCAAGCTCT CGTCTGTGGC AGAGGTTCGC TTCGGGCCGG CGTTTTATCT 

351 GATGTTCGCG CTGTCAGTTA TGCTGATTCG GACTTCGGTA TCGGTTCCCC 

401 AG CATTGGGT GTATTTTCAA ATCGGGCGGC TGACGGGGGA TAATGCGGTT 

4 51 CAGACGGCAT CGGAAGGTAA AACCTGTTGC AGCCGCTGCC TGTATTTCCG 

501 CGACAGTGCC GAATCCCCCT GCGGGGTGTG CGGTGCGGAA CTGTACCGCC 

551 GACGGCCGAA AAGTCTGAGT ATTTCGTCGG CGTTTCTGAC GGCGGCGGTT 

601 ATTTTGTATT TCCCTGCCAA TATCCTGCCG ATTATGATTT CGTCCAATCC 

651 TGCCGCCACG GAGGTCAATA CCATCCTTAA CGGCATCGCT TATATGTGGG 

701 ACGAGGGCGA CAGGCTGATT GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 

751 GTGCCGGTAC TGAAGATTGC GGCAATGTCG GTTTTGATTG CGTCCGCCCG 

801 CTTCGCTTTG CCAACGGGTG CAAAGAAATT GTCGCACCTC TACCGCATCA 

65 1 CCGAAGCGGT CGGCCGCTGG TCGATGATTG ATATTTTTGT GATTATTATT 

901 TTGATGTGTT CGTTCCACAC TTATG CCGCG CGCGTCATTC CGGGCAGTGC 

951 GGCAGTCTAT TTCTGCCTGG TCGTGATTCT GACGATGCTG TCCGCCTATT 

1001 ATTTCGACCC GCGCCTGCTT TGGGACAAAC GCGCTTCAGA CGGCATTGCT 

1051 TT CAATG AAA CGGAAAAACA TGACTGA 

This corresponds to the amino acid sequence <SEQ ID 628; ORF 153>: 

ml 5 3 .pep 

1 MAFAYGMTYI EVGIPGAASV LSLPEMMRliM VFQDYGFLAE VMFVLTFGAP 

51 VLFLLLCLYV YAALIRKQAY PALRLATRVM VRLRQAMMVD VFFVSTLVAY 
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101 IKLSSVAEVR FGP AFYLMFA LSVMLIRTSV SVPQHWVYFQ IGRLTGDNAV 

151 QTASEGKTCC SRCLYFRDSA ESPCGVCGAE LYRRRPKSLS ISS AFLTAAV 

201 ILYFPANILP IMISSNPAAT EVNTILNGIA YMWDEGDRLI AAVIFSASIL 

251 VPVLKIAAMS VLIASARFAT, PTGAKKLSHL YRITE AVGRW SMIDIFVII I 

301 LMCSFHTYAA RVIPGSAAVY FCLWILT ML SAYYFDPRLL WDKRASDGIA 

351 FNETEKHD* 

ml53 / gl53 96.1% identity in 358 aa overlap 

10 20 30 40 50 60 

ml53 . pep ^ fay gmtyievgipgaasvlslpemmrlmvfqdygfi^vmfvltfgapvlflllclyv 

Mnmmim iiiiiiiiiiiiiiiiiiiiiimiiiiiitiniiiiiiiii 

MGFAYSrrTYIEVGIPEAASVI^LPEMMRl^QDYGFIAEVMFVLTFGAPVLF^CLW 
10 20 30 



PCT7US99/09346 



gi53 



mi53 .pep 



40 50 60 



70 80 90 100 HO 120 

YAAL I RKQAYPALRLATR VMVRLRQAMMVD VFFVSTLVAYI KLS S VAE VRFG P AF YLM FA 

IIII ' I| I'IIIIMIIIII!1NI||MI!|MIIIIIII!II![|:MIIIIIIII!I 

9153 YAAL I RKQAYP ALRLATRVMVRLRQAMMVDVFF VS TLVAYI KLS S VAK VR FG P AFYLMFA 

70 80 90 100 HO 120 

130 140 150 160 170 180 

ml53 . pep LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 

111111 iiiiiiiiiiii!iiiiiii:iiiiiMiiiiiiiiiiiiii linn mill 

9153 LSVMLIRTSVSVPQHWVYFQIGRLTGl^AVQTASEGKTCCSRCLYFRDS 

130 140 150 160 nn i oa 



150 160 170 180 

190 200 210 220 230 240 

ml53 . pep LYRRRPKSLS I SSAFLTAAVILYFPANI LPIMI SSNPAATEVNTILNGIAYMWDEGDRLI 

9153 LYGGRPKSLSISSAFLTAAVVLYFPANILPIMISSNPAATEANTXFSGIAYIWDEGDRL^ 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml53 . pep AAyiFSASILVPVLKIAAMSVLIASARFALPTGAKKLSHLYRITEAVGRWSMIDIFVIII 

> ' 1 1 1 1 1 1 i I i i I M 1 1 1 1 1 M I f ^ ] 1 1 { 1 1 = 1 1 ! F i i 1 1 1 1 1 1 i 1 f 1 1 f II 1 1 1 1 1 1 j | 

9153 AAVI FS AS I LVPVLKI AAMSVLI AAARFALPAGAKKLSHLYRI TE AVGRWSMI D I FVI 1 1 

250 2fi 0 270 280 290 300 



310 320 330 340 350 359 

mlS 3 . pep LMCS FHT YAARVI PGSAAVYFCLWI LTMLSAYYFDPRLLWDKRASDG IAFNETEKHDX 

> ' 1 1 1 1 M 1 1 1 1 f 1 1 1 1 1 II M ! 1 1 1 1 1 M ! M 11 1 1 ! 1 1 1 1 1 If 1 1 M 1 1 M f 1 1 = 1 1 

9153 IjMCS FHT YAARVI PGSAAVYFCLWI LTMLSAYYFDPRLLWDKRASDG I AFWETEKYDX 

310 320 330 iAn r> c r> 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 629>< 

a!53.seq 

1 ATGGCGTTTG CTTACGGTAT GACGTATATC GAGGTCGGGA TACCGGGTGC 

51 GGCATCCGTC CTTTCGCTGC CCGAGATGAT GCGCCTGATG GTGTTTCAGG 

101 ATTATGGTTT TTTGGCCGAA GTGATGTTTG TGCTGACCTT CGGCGCGCCG 

151 GTTCTGTTTC TGCTGCTGTG CCTGTATGTC TATGCCGCGC TGATACGGAA 

201 ACAGGCGTAT CCTGCGCTGC GTTTGGCAAC GCGTGTGATG GTGCGCTTGA 

251 GACAGGCGAT GATGGTGGAT GTGTTTTTTG TTTCCACTTT GGTGGCGTAT 

301 ATCAAGCTCT CGTCTGTGGC AGAGGTTCGC TTCGGATCGG CGTTTTATCT 

351 GATGTTCGCG CTGTCGGTTA TGCTGATTCG GACTTCGGTA TCGGTTCCCC 

4 01 AGCATTGGGT GTATTTTCAA ATCGGGCGGC TGACGGGGGA TAATGCGGTT 

4 51 CAGACGGCAT CGGAAGGTAA AACCTGTTGC AGCCGCTGCC TGTATTTCCG 

501 CGACAGTGCC GAATCCCCCT GCGGCGTGTG CGGTGCGGAA CTGTACCGCC 

551 GACGGCCGAA AAGTCTGAGT ATTTCGTCGG CGTTTCTGAC GGCGGCGGTT 

601 ATTTTGTATT TCCCTGCCAA TATCCTGCCG ATTATGATTT CGTCCAATCC 

-651 TGCCGCCACG GAGGTCAATA CCATCCTTAA CGGCATCGCT TATATGTGGG 

7 01 ACGAGGGCGA CAGGCTGATT GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 

7 51 GTGCCGGTAC TGAAGATTGC GGCAATGTCG GTTTTGATTG CGTCCGCCCG 

801 CTTCGCTTTG CCAACGGGTG CAAAGAAATT GTCGCACCTC TACCGCATCA 
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851 CCGAAGCGGT CGGCCGCTGG TCGATGATTG ATATTTTTGT GATTATTATT 

901 TTGATGTGTT CGTTCCACAC TTATGCCGCG CGCGTCATTC CGGGCAGTGC 

951 GGCAGTCTAT TTCTGCCTGG TCGTGATTCT GACGATGCTG TCCGCCTATT 

1001 ATTTCGACCC GCGCCTGCTT TGGGACAAAC GCGCTTCAGA CGGCATTGCT 

1051 TTCAATGAAA CGGAAAAACA TGACTGA 



This corresponds to the amino acid sequence <SEQ ID 630; ORF 153.a>: 

al53 . pep 

1 MAFAYGMTYI EVGIPGAASV LSLPEMMRLM VFQDYGFLAE VMFVLTFGAP 

51 VLFLLLC LYV YAALIRKQAY PALRLATRVM VRLRQ AMMVD VFFVSTLVAY 

101 IKLSSVAEVR FGS AFYLMFA LSVMLIRTSV SVPQHWVYFQ IGRLTGDNAV 

151 QTASEGKTCC SRCLYFRDSA ESPCGVCGAE LYRRRPKSLS ISSAFLTAAV 

201 ILYFPANILP I MISSNPAAT EVNTILNGIA YMWDEGDRL I AAVIFSASIL 

251 VPVLKIAAMS VLIASARFAL PTGAKKLSHL YRITE AVGRW SMIDIFVIII 

301 LMC SFHTYAA R VIPGSAAVY FCLWILT ML SAYYFDPRLL WDKRASDGIA 

351 FNETEKHD* 



m!53/al53 99.7% identity in 358 aa overlap 

10 20 30 40 50 60 

ml53.pep MAFAYGMTYIEVGIPGAASVLSLPEMMRLMVFQDYGFLAEVMFVLTFGAPVLFLLLCLYV 
I I I I I I M I I I M I I I I I I I I 1 I I I I I I II II M I I I I I I I t I I I M II II I I I I I I I M 
al53 MAFAYGMTYIEVGIPGAASVLSLPEMMRLMVFQDYGFLAEVMFVLTFGAPVLFLLLCLYV 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 5 3 . pep YAAL I RKQAYPALRLATRVMVRLRQAMMVD VFFVSTLVAY I KLSSVAEVRFGPAFYLMFA 

I M I I I I I It I I I I I ! I I I I I I I I I I I I I I II I I | | | | | | | | | I M I I M I I I I I I I I I 
al53 YAALIRKQAYPALRLATRVMVRLRQAMMVDVFFVSTLVAYIKLSSVAEVR FGSAFYLMFA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 53 . pep LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 
M I I I I I I I I I i I I I I I I M I I I I I I I I I I I I I I M I I I I I M II I I M ! I M I M II I I 
al53 LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml53.pep LYRRRPKSLS I SSAFLTAAVILYFPANILPIMISSNPAATEVNTILNGIAYMWDEGDRL'l-' 

I I M M I I II M I I I I I I I II I I I I I 1 M I | | | | | | | M | M M I I M I I I I I I I I I I I I 
al53 LYRRRPKSLS I SSAFLTAAVILYFPANILPIMISSNPAATEVNTILNGIAYMWDEGDRLI 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 5 3 . pep AAVI FS AS ILVPVLKIAAMS VLIASARFALPTGAKKLSHLYRITEAVGRWSMI DI FVI 1 1 

I I I M I I I I I I I I I I M I I II I I I I I I I I I I I I I I I I | | | | | M I M I II I I I I I I I I I I 
al53 AAVI FS AS ILVPVLKIAAMS VLIASARFAL PTGAKKLSHL YRITEAVGRWSMI DI FVI 1 1 

250 260 270 280 290 300 



310 320 330 340 350 359 

LMCS FHT YAARVI PG S AAVYFCLW I LTMLS AY YFDPRLLWDKRAS DG I AFNETEKHDX 
M 1 I I I I M I I I I I J I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I M I I I I I II 11 I 
LMC S FHT YAARV I PG S AAV YFCLW I LTMLS AY YFDPRLLWDKRAS DG I AFNETEKHDX 
310 320 330 340 350 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 63 1>: 

gl54 .seq 

1 ATGACTGACA ACAGCCCTCC TCCAAACGGA CACGCTCAAG CACGCGTCCG 

51 CAAAAACAAC accttcctCT CCGCCGTCTG GCTGGTCCCG CTGATCGCGC 

101 TGATTGCCGG CGGCTGGCTT TGGGTTAAGG AAATCCGCAA CAGGGGG CCT 

151 GTGGTTACGC TCTTGATGGA CAGCG CGGAA GGCATCGAAG TCAACAATAC 

201 GGTCATTAAG GTATTGAGCA TCGATGTCGG ACGCGTTACC CGAATCAAAC 

251 TGCGCGACGA CCAAAAAGGC GTGGAAGTTA CTGCCCAACT CAATGCGGAC 

301 GTATCCGGCC TCATCCGCAG CGATACCCAG TTTTGGGTGG TCAAGCCGCG 

351 TATCGACCAA AGCGGcgtAA CCGGTTTGGG TACGCTGCTT TCGGGTTCGT 



ml 5 3 . pep 
al53 
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401 ACATCGCTTT TACACCCGGC AAAAGCGGCG AGGCAAAAGA CGTGTTCCAA 

451 GTGCAGGACA TTCCGCCCGT TACCGCCATC GGGCAAAgcg GGCTGCGCTT 

501 GAATTTGATT GGTAAAAACG AccgCATCCT CAACGTcaaC AGCCCTGTTT 

551 TGTATGAAAA CTTTATGGTC GGGCAAATCG AAAGCGCGCA TTTCGAcccG 

601 TCCGACCAAA GCGTGCATTA CACCATCTTC ATCCAAAGCC CCAACGACAA 

651 ACTGATTCAT TCCGCCAGCC GTTTTTGGCT GGAAAGCGGC AT CAATATCG 

701 AAACCACAGG CAGCGGCATC AAACTCAATT CCGCCCCTCT GCCTGCCCTG 

751 CTGTCAGGCG CGATTTCATT TGATTCGCCG AAAACCAAAA ACAGTAAAAA 

801 CGTCAAAAGC GAGGAC AG CT TCACGCTTTA CGACAGCCGC AGCGAAATCG 

851 CCAACCTGCC TGACGACCGC TCGCTGTACT ACACCGCGTT TTTCAAACAA 

901 TCCGTGCGCG GACTGACCGT cggTTCGCCT GTcgaATACA AAGGGCtgaA 

951 TG TcggCATG GTTTCCGATG TCCCTTATTT TGACCGCAAt gacagCCTGC 

1001 ACCtgtTTGA aaacggctgg aTTcccGtac gCATCCGCAT cgagccTTCC 

1051 CGTTTGGAAA TCAATGCCGA CGAGCAAAGC AAAGAGCATT GGAAACAACA 

1101 ATTCCAGACG G CCTTAAACA AAGG CCTGAC CGCCACCATC TCCAGCAACA 

1151 ACCTG CTGAC CGGCGGCAAA ATGATTGAGT TGAACGATCA GCCTTCCGCC 

1201 TCGCCCAAGC TG CGACCGCA TACCGTTTAT GCAGG CGATA CCGTCATCGC 

1251 CACACGGGGC GGCGGTTTGG ATGACTTGCA GGTCAAATTG GCGGATTTGC 

1301 TGGACaaatT CAACAATCTG CCATTggata aAACCGTTGC CGAATTGAAC 

13 51 GGCTCGCTCG CCGAACT CAA GTCCGCACTC AAATCCGCCA ATGCCGCCCT 

14 01 AAGCTCCATT GacaAACTGG TCGgcaaTCC GCAGACGCAA AACATCCCGA 
1451 ACGAACTGAA CCAAACTCTG AAAGAGTTGC GCATAACCCT GCAAGGCGTA 
1501 TCGcctCAAT CGCCTATCTa cgGAgacgta caAAATAcgc tgCaAAGTTT 
1551 GGACAAAACC TTAAAagacg TtcaACCCGT CATTAACACT TTGAaAGAAa 
1601 aacCCaaCgc actGATTTtc aacaACAGCA GCAAAGAccc tATCCCGAAA 
1651 GGAAGCCGAT AA 
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This corresponds to the amino acid sequence <SEQ ID 632; ORF 154.ng>: 

gl54 *pep * * © ■ 

1 MTDNSPPPNG HAQARVRKNN TFLSAVWLVP LIALIAGG WL WVKEIRNRGP 

51 WTLLMDSAE GIEVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGLIRSDTQ FWWKPRIDQ SGVTGLGTLL SGSYIAFTPG KSGEAKDVFQ 

151 VQDIPPVTAI GQSGLRLNLI GKNDRILNVN SPVLYENFMV GQIESAHFDP 

201 SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTGSGI KLNSAPLPAL 

251 LSGAISFDSP KTKNSKNVKS EDSFTLYDSR SEIANLPDDR SLYYTAFFKQ 

301 SVRGLTVGSP VEYKGLNVGM VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 

3 51 RLEINADEQS KEHWKQQFQT ALNKGLTATI SSNNLLTGGK MIELNDQPSA 

4 01 SPKLRPHTVY AGDTVI ATRG GGLDDLQVKL ADLLDKFNNL PLDKTVAELN 
4 51 GSLAELKSAL KSANAALSS I DKLVGNPQTQ NIPNELNQTL KELRITLQGV 
501 SPQSPIYGDV QNTLQSLDKT LKDVQPVINT LKEKPNALIF NNSSKDPIPK 
551 GSR* 

Sl«q 0Win8 Paitial DNA se< l uence was identified in N. meningitidis <SEQ ID 633>: 

1 ATGACTGACA ACAGCCCTCC TCCAAACGGA CACGCCCAAG CACGCGTCCG 

51 CAAAAACAAC ACCTTCCTCT CTGCCGTCTG GCTGGTTCCG CTGATCGCGC 

101 TGATTGCCGG CGGCTGGCTT TGGGTTAAGG AAATCCGCAA CAGGGGGCCT 

151 GTGGTTACGC TCTTGATGGA CAGCGCGGAA GGCATTGAGG TCAACAATAC 

2 01 GGTCATCAAA GTATTGAGCA TCGATGTCGG ACGCGTTACC CGAATCAAAC 

251 TGCGCGACGA CCAAAAAGGC GTGGAAGTAA CCGCCCAACT CAATGCGGAC 

301 GTATCCGGCC TCATC CGCAG CGATACCCAG TTTTGGGTGG TCAAGCCGCG 

351 TAT CGACCAA AGCGG CGTAA CCGGTTTGGG TACGCTGCTT TCGGGTTCGT 

401 ACATCGCCTT TACACCCGGC AAAAG CGACG AGGCAAAAGA CGTGTTCCAA 

451 GTGCAGGACA TTCCGCCCGT TACCGCCATC GGGCAAAGCG GGCTGCGCTT 

501 GAATTTGATT GGTAAAAACG ACCGCATCCT CAACGTCAAC AGCCCTGTTT 

551 TGTATGAAAA TTTTATGGTC GGGCAAGTCG AAAGCGCGCA TTTCGACCCG 

601 TCCGACCAAA GCGTGCATTA CACCATCTTC ATCCAAAGCC CCAACGACAA 

651 ACTGATTCAT TCCGCCAGCC GTTTTTGGCT GGAAAGCGGC AT CAATATCG 

701 AAACCACAGG CAGCGGCATC AAACTCAATT CCGCCCCTCT GCCTGCCCTG 

751 CTGTCGGGCG CGATTTCATT TGATTCGCCG AAAACCAAAA ACAGTAAAAA 

801 CGTCAAAAGC GAAGACAGCT TCACGCTTTA CGACAGCCGC AGCGAAGTCG 

851 CCAACCTGCC TGACGACCGC TCGCTGTACT ACACCGCGTT TTTCAAACAA 

901 TCCGTGCGCG GCCTGACCGT CGGTTCGCCC GTCGAGTACA AAGGGCTGAA 

951 TGTCGGCGTG GTTTCCGACG TTCCTTATTT CGACCGCAAC GACAGCCTGC 

1001 ACCTGTTTGA AAACGGCTGG ATACCCGTAC GCATCCGCAT TGAACCTTCC 

1051 CGTTTGGAAA TCAATGCCGA CGAACAAAGC AAAGAACATT GGAAACAACA 

1101 ATTTCAGACG G CCTTAAACA AAGG CCTGAC CGCCACCATC TCCAGCAACA 

1151 ACCTG CTGAC CGGAAG CAAA ATGATTGAGT TGAACGATCA GCCTTCCGCA 
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1201 TCACCTAAGC TGCGACCGCA 

1251 GACCCAGGGC GGCGGTTTGG 

1301 TGGACAAGTT CGACAAACTG 

13 51 GGTTCGCTTG CCGAGCTCAA 
1401 AAGCTCCATC GACAAACTGG 

14 51 ACGAACTGAA CCAAACCCTG 
1501 TCGCCG CAAT CGCCTATCTA 
1551 GGACAAAACT TTAAAAGACG 
1601 AACCCAACGC GCTGATTTTC 
1651 GGAAGCCGAT AA 



TACCGTTTAT GCAGGCGATA CCGTTATCGC 
ACGATTTGCA GGTCAAATTG GCGGATTTGC 
CCTTTAGATA AG ACGGTTG C CGAATTGAAC 
ATCCACACTC AAATCTGCCA ATGCCGCCCT 
TCGGCAAACC GCAGACACAA AACATTC CGA 
AAAGAGTTG C GCACAACCCT GCAAGGCGTA 
CGGCGACGTA CAAAATACGC TGCAAAGTTT 
TTCAACCCGT GATTAATACT TTGAAAGAAA 
AACAGCAGCA GCAAAGACCC TATCCCGAAA 



This corresponds to the amino acid sequence <SEQ ID 634 ORF 154.a>: 

ml54 .pep 

1 MTDNSPPPNG HAQARVRKNN T FLSAVWLVP LIALIAGG WL WVKEIRNRGP 

51 WTLLMDSAE GIEVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGIilRSDTQ FWWKPRIDQ SGVTGLGTLL SGSYIAFTPG KSDEAKDVFQ 

151 VQDIPPVTAI GQSGLRLNLI GKNDRILNVN S PVLYENFMV GQVESAHFDP 

201 SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTGSGI KLNSAPLPAL 

251 LSGAISFDSP KTKNSKNVKS EDSFTLYDSR SEVANLPDDR SLYYTAFFKQ 

3 01 SVRGLTVGSP VEYKGLNVGV VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 

351 RLEINADEQS KEHWKQQFQT ALNKGLTAT I SSNNLLTGSK MIELNDQPSA 

401 SPKLRPHTVY AGDTVIATQG GGLDDLQVKL ADLLDKFDKL PLDKTVAELN 

451 GSLAELKSTL KSANAALSSI DKLVGKPQTQ NIPNELNQTL KELRTTLQGV 

501 SPQSPIYGDV QNTLQSLDKT LKDVQPVINT LKEKPNALIF NSSSKDPIPK 

551 GSR* 



ml54 / gl54 97.8% identity in 553 aa overlap 



10 20 30 40 50 60 

ml 54 .pep MTDN S P PPNGHAQARVRKNNTFLS AVWL VP L I AL I AGGWLWVKE I RNRG P WT LLMDS AE 

IIIIIIIIIIIIIM1IIIIIIIIIIIIIIIIIIIIIIIIEII1MMIIIIIIIIIIM 

gl54 MTDNSPPPNGHAQARVRKNNTFLSAVWLVPLIALI AGGWLWVKE I RNRGP WTLLMDSAE 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml54 .pep GIEVNNTVIKVLSIDVGRVTRIKljRDDQKGVEVTAQLNAD^ 

MM HMIIIII IMII IIMIMM IIIIIHII I IIIIIIIIIIMIMIIIIUI 

91 5 4 G I EVNNTVI KVLS I DVGRVTRI KLRDDQKGVEVTAQLNADVSGL IRSDTQFWWKPRIDQ 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml54 .pep SGVTGIiGTLI^GSYIAFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 

IIIIIIIIMIIIIIIIIIIII IIIIIIIIIIIIIIIIIIIMIIIIIilllMIIIII 

9154 SGVTGIX3TLLSGSYIAFTPGKSGEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 
130 140 150 160 170 180 



190 200 210 220 230 240 

ml 54 - pep SPVLYENFMVGQVESAHFDPSDQSVHYTIFIQSPNDKLIHSASRFWLESGINIETTGSGI 

lllllllllllhlllllllllllllllllllllllllllllllllllllllllllllll 
9154 SPVLYENFMVGQIESAHFDPSDQSVHYTIFIQSPNDKIjIHSASRFWLESGINIETTGSGI 
190 200 210 220 230 240 



250 260 270 280 290 300 

ml54 - pep KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKQ 

M M MMMiM MM M MM M Ml IIMM MM MMMMMi lM MM M I 

9154 KLNSAPLPALLSGAISFDSP KTKNSKNVKS EDS FTLYDSRSE I ANLPDDRSLYYTAFFKQ 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 5 4 . pep SVRGLTVGS PVEYKGLNVG WSDVPYFDRNDSLHLFENGWI PVR IRIE PSRLE INADEQS 

I MMMIMMM MMIMMMMMIMMMMMMMMMMMMMMM 

gl S4 SVRGLTVGS PVEYKGLNVGMVSDVPYFDRNDS LHLFENGWI PVRIRIE PSRLE INADEQS 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml 54 . pep KEHWKQQFQT ALNKGLTAT I SSNNLLTGSKMIELNDQPSASPKLRPHTVYAGDTVIATQG 

I IIMM Mil III III III M I II I Ihllllltlh Ml III IIIIMIIiMIM:| 

gl54 KEHWKQQFQTALNKGLTAT I SSNNLLTGGKMI ELNDQPSAS PKLRPHTVYAGDTVI ATRG 



BNSDOCID: <WO S957280A2J_^ 



WO 99/57280 
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442 



370 



ml 54 . pep 
gl54 

ml 5 4 .pep 
gl54 

ml54 .pep 
gl54 



3B0 



390 



400 



410 



420 



430 440 450 460 470 480 

GG^DLQV^ 

Ji ' Mllllllllll,::| INIII|||||||||M|:||||||||i||||||| : ||(| 
GGLDDLQ VKLADLLDKFNNL PLDKTVAE LNGS LAELKS ANAALS S IDKL VGNPQTQ 
430 440 450 460 470 480 



490 S0 ° 510 520 530 540 

1111,11111,111 N 1 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 f I 

NT PNEI^QTLKEIjRITLQGVSPQS PIYGDVQNTLQS^ p 
490 500 510 520 



530 



54 0 



550 

NSSSKDPIPKGSRX 

hllllllllll.il 

NNS SKDP I PKGSRX 
550 



The following partial DNA sequence was identified in N. meningitidis <SEO ID 635>- 

al54.seq ^ 
ATGACTGACA ACAGCCCTCC TCCAAACGGA CACGCCCAAG CACGCGTCCG 
CAAAAACAAC ACCTTCCTCT CTGCCGTCTG GCTGGTTCCG CTGATCGCGC 
TGATTGCCGG CGGCTGGCTT TGGGTTAAGG AAATCCGCAA CAGGGGGCCT 
GTGGTTACGC TCTTGATGGA CAGCGCGGAA GGCATTGAGG TCAACAATAC 
GGTCATCAAA GTATTGAGCA TCGATGTCGG ACGCGTTACC CGAATCAAAC 
TGCGCGACGA CCAAAAAGGC GTGGAAGTAA CCGCCCAACT CAATGCGGAC 
GTATCCGGCC TCATCCGCAG CGATACCCAG TTTTGGGTGG TCAAGCCGCG 
TATCGACCAA AGCGGCGTAA CCGGTTTGGG TACGCTGCTT TCGGGTTCGT 
ACATCGCCTT TACACCCGGC AAAAGCGACG AGGCAAAAGA CGTGTTCCAA 
GTGCAGGACA TTCCGCCCGT TACCGCCATC GGGCAAAGCG GGCTGCGCTT 
GAATTTGATT GGTAAAAACG ACCGCATCCT CAACGTCAAC AGCCCTGTTT 
TGTATGAAAA CTTTATGGTC GGGCAAGTCG AAAGCGCGCA TTTCGACCCG 
TCCGACCAAA GCGTGCATTA CACCATCTTC ATCCAAAGCC CCAACGACAA 
ACTGATTCAT TCCGCCAGCC GTTTCTGGCT GGAAAGCGGC AT CAAT AT CG 
AAACCACAGG CAGCGGCATC AAACTCAATT CCGCCCCTCT GCCTGCCCTG 
CTGTCGGGCG CGATTTCATT TGATTCGCCG AAAACCAAAA ACAGTAAAAA 
CGTCAAAAGC GAAGACAGCT TCACGCTTTA CGACAGCCGC AGCGAAGTCG 
CCAACCTGCC TGATGACCGT TCGCTGTACT ACACCGCGTT TTTCAAACAA 
TCCGTGCGCG GACTGACCGT CGGTTCGCCT GTCGAGTACA AAGGGCTGAA 
TGTCGGCGTG GTTTCCGATG TTCCTTATTT CGACCGCAAC GACAGCCTGC 
ACCTGTTTGA AAACGGCTGG ATTCCCGTAC GCATCCGTAT TGAGCCTTCC 
CGTTTGGAAA TCAATGCCGA CGAACAAAGC AAAGAACATT GGAAACAACA 
ATTTCAGACG GCCTTAAACA AAGGCCTGAC CGCCACCATC TCCAGCAACA 
ACCTGCTGAC CGGCAGCAAA ATGATTGAGT TGAACGATCA GCCTTCCGCC 
TCGCCCAAGC TGCGACCGCA TACCGTTTAT GCAGGCGATA CCGTTATCGC 
GACCCAGGGC GGCGGTTTGG ACGATTTGCA GGTCAAATTG GCGGATTTGC 
TGGACAAGTT CGACAAACTG CCTTTAGATA AGACGGTTGC CGAATTGAAC 
GGTTCGCTTG CCGAGCTCAA ATCCACACTC AAATCTGCCA ATGCCGCCCT 
AAGCTCCATC GACAAACTGG TCGGCAAACC GCAGACACAA AACATTCCGA 
t^^l^ CCAAACCCTG AAAGAGTTGC GCACAACCCT GCAAGGCGTA 
TCGCCTCAAT CGCCTATCTA CGGCGACGTA CAAAATACGC TGCAAAGTTT 
GGACAAAACC TTAAAAGACG TTCAACCCGT CATTAACACT TTGAAAGAAA 
AACCCAACGC GCTGATTTTC AACAGCAGCA GCAAAGACCC TATCCCGAAA 
GGAAGCCGAT AA 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 



This corresponds to the amino acid sequence <SEQ ID 636- ORF 154 a>- 

al54 . pep . - . ' * 

1 MTDNSPPPNG HAQARVRKNN T FLSAVWLVP LIALIAGG WT. WVKEIRNRGP 

51 VVTLLMDSAE GIEVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGLIRSDTQ FWWKPRIDQ SGVTGLGTLL SGSYIAFTPG KSDEAKDVFQ 
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151 VQDIPPVTAI GQSGLRLNLI GKNDRILNVN SPVLYENFMV GQVESAHFDP 

201 SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTGSGI KLNSAPLPAL 

251 LSGAISFDSP KTKNSKNVKS EDSFTLYDSR SEVANLPDDR SLYYTAFFKQ 

301 SVRGLTVGSP VEYKGLNVGV VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 

351 RLEINADEQS KEHWKQQFQT ALNKGLTATI SSNNLLTGSK MIELNDQPSA 

4 01 SPKLRPHTVY AGDTVIATQG GGLDDLQVKL ADLLDKFDKL PLDKTVAELN 

4 51 GSLAELKSTL KSANAALSSI DKLVGKPQTQ NIPNELNQTL KELRTTLQGV 

501 SPQSPIYGDV QNTLQSLDKT LKDVQPVINT LKEKPNALIF NSSSKDPIPK 

551 GSR* 



ml 54/al 54 1 00.0% identity in 553 aa overlap 

10 20 30 40 50 60 

ml 5 4 . pep MTDNSPPPNGHAQARVRKNNTFLSAVWLVPLIALIAGGWLWVKEIRNRGPWTLLMDSAE 
N I M M I I I I I I I I I I I I I | | | If I I I I I I ! t I I I I I I j I ! | | | | | M I I I I II I II I I 
al 54 MTDNSPPPNGHAQARVRKNNTFLSAVWLVPLIALIAGGWLWVKEIRNRGPWTLLMDSAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml54.pep GIEVNNTVIKVLSIDVGRVTRIKLRDDQKGVEVTAQLNADVSGLIRSDTQFWWKPRIDQ 
I M If I I I i I I I I I I I I I I I I M I I I I I II i I I ! I 1 M II I I I I I I I I I I I I I I I I I I I I 
al54 GIEVNNTVIKVLSIDVGRVTRIKLRDDQKGVEVTAQLNADVSGLIRSDTQFWWKPRIDQ 

7 0 80 90 100 110 120 

130 140 150 160 170 180 

ml54 .pep SGVTGLGTLLSGSYIAFT PGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 

I I I I I M I I I I I I I I I I I I | | | | | | | | | | | | M I I I I I I I I I I I M I | | M I II I I I I I I 
al54 SGVTGLGTLLSGSYIAFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml54 . pep SPVLYENFMVGQVESAHFDPSDQSVHYTIFIQSPNDKLIHSASRFWLESGINIETTGSGI 
I I M I I M M I I I II I I I M If I I I M I I I f I I I I I I I I I I I I I I I f I I I I I I M I I I II 
*154 SPVLYENFMVGQVESAHFDPSDQSVHYTIFIQSPNDKLIHSASRFWLESGINIETTGSGI 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 5 4 . pep KLNSAPL PALLS GAI S FDS PKTKNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKQ 

I I I M I I I I I I I I I I I I I I M I I I I M | | I j I II I II I I i I I I I M I I I I I I I I I I I I I I J 
al54 KLNSAPLPALLSGAI S FDS PKTKNSKNVKSEDS FTLYDSRSE VANLFDDRSLY YTAFFKQ * 

250 260 270 280 290 300 * 

310 320 330 340 350 360 

ml54 . pep SVRGLTVGSPVEYKGLNVGWSDVPYFDRNDSLHLFENGWIPVRIRIEPSRLEINADEQS 

i i i i 1 1 M 1 1 1 1 1 1 1 1 j 1 1 1 1 1 M 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

a 154 SVRGLTVGSPVEYKGLNVGWSDVPYFDRNDSLHLFENGWIPVRIRIEPSRLEINADEQS 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml54 . pep KEHWKQQFQT ALNKGLT AT I SSNNLLTGSKMIELNDQPS AS PKLRPHTVYAGDT VI AT QG 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I II I I f I I I I | I I I I I 
a 1 5 4 KEHWKQQFQTALNKGLTAT I S SNNLLTGSKMIE LNDQPS AS PKLRPHTVYAGDT VI ATQG 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml54 . pep GGLDDLQVKLADLLDKFDKLPLDKTVAELNGSLAELKSTLKSANAALSSIDKLVGKPQTQ 
N I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al54 GGLDDLQVKLADLLDKFDKLPLDKTVAELNGSLAELKSTLKSANAALSSIDKLVGKPQTQ 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml54 . pep NIPNELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 
I I I J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I j 
ai5 4 NIPNELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 

490 500 510 520 530 540 
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ml 5 4 .pep 
al54 



550 

NSSSKDPIPKGSRX 
I I M I I I I I I | | | | 
NSSSKDPIPKGSRX 
550 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 637>* 

gl55.seq HC 
atGAAaatcg GtatcCCACG CGAGTCAtta tcCGGCGAAA cccgcgtagc 
ctgcAcgccc gCCACCGTTG CCctgctggg caAactAGGC TTTGAAACCG 
TTGtcgaAAG CGGTGCAggt TTGGCGGCAA GTTTggaCGA TGCCGCTTAC 
CAAACAGCAG GCGCAACCGT TGCCGACAAA GCGGCGGTTT GGGCCTGCCC 
TTTAATTTAT AAGGTCAACG CGCCGTCCGA AGGCGAGCTG CCGCTGCTCA 
AAGAAGGTCA AACCATCGTC AGCTTCCTGT GGCCGCGCCA AAACGAGGCT 
TTGGTCGAGG CCTTGCGCGC CAAGAAAGTC AACGCGCTGG CG AT GG AC AT 
GGTTCCCCGC ATTTCCCGCG CTCAGGCCTT GGACGCTTTG TCTTCAATGG 
CAAACATCAG CGGCTACCGC GCCGTGATTG AAGCCGCCAA CGCCTTCGGC 
CGTTTCTTCA CCGGTCAAAT CACTGCCGCC GGCAAAGTGC CGCCTGCGCA 
GGTTTTGGTG ATTGGCGCCG GTGTGGCGGG TTTGGCGGCA ATCGGTACGG 
CAAATTCGCT CGGCGCAGTG GTGCGCGCGT TCGATACCCG CTTGGAAGTG 
GCGGAACAAA TCGAATCGAT GGGCGGTAAG TTcctGAAAC TCGACTTCCT 
GCAAGAATCG GGCGGCAGCG GAGACGgctA CGCCAAAGTG ATGAGCGACG 
AATTTATCGC CGCCGAAATG AAGCTCTTTG CCGAACAGGC GAAAGAAGTG 
GACATCATCA TCACCACCGC CGCCATTCCG GGCAAACCCG CTCCCAAGCT 
GATTACCAAA GAAATGGTGG AAAGCATGAA ATCCGGATCC GTCATCGTCG 
ATTTGGCGGC GACGGGCGGC AACTGCGAAC TCACCCGACC GGGCGAATTG 
TCCGTAACCG GCAACGGCGT GAAAATCATC GGCTACACCG ACATGGCAAA 
CCGCCTTGCC GGACAGTCTT CCCAGCTTTA CGCCACCAAC TTGGTGAACC 
TGACCAAGCT GTTAAGCCCG AACAAAGAcg gcgaAATCAC GCTGGACTTC 
GAAGacgtGA TTATCCGCAA TATGACCGTT ACCCGcgacg gcgaaATCAC 
CTTCCCGCCT CCGccgaTTc aggtTTCcgc ccggccgCAG CAAAcgccgt 
ctgaAAAagc cgcGCCTGCC GCCAagcccg AgccGaaacc tgttCCcctg 
tggaAAAaac tcgCGCCCGC CGCcatcgCC GCCGTATTGG tgctgtgGgt 
cggCgcggtc gcacccgcag CATTCTTGAA CCACTTTATC GTCTTCGTCC 
TCGCCTGCGT CATCGGCTAC CATGTCGTTT GgaacgTCAG CCACTCGCTG 
CACACACCGC TGAtgtcggt aaccaaCgcc atctccGGCA tcatggtcgt 
cggCGCGCTG CTGCAAATCG GTCAGGGcaa cggcttcgtT TCgctGCTGT 
CGTTTGTTGC CATCCTGATT GCCGGCATCA ATATCTTCGG CGGCTTTGCG 
GTTACACGGC GTATGCTGAA TATGTTTAAG AAAGGGTAA 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 



This corresponds to the amino acid sequence <SEQ ID 638; ORF 

gl55.pep * 

MKIGIPRESL SGETRVACTP ATVALLGKLG FETWESGAG 
QTAGATVADK AAVWACPLIY KVNAPSEGEL PLLKEGQTIV 
LVEALRAKKV NALAMDMVPR I SRAQALDAL SSMANISGYR 
RFFTGQITAA GKVPPA QVLV IGAGVAGLAA IGTAW ST.flav 
AEQIESMGGK FLKLDFLQES GGSGDGYAKV MSDEFIAAEM 
DIIITTAAIP GKPAPKLITK EMVESMKSGS VIVDLAATGG 
SVTGNGVKII GYTDMANRLA GQSSQLYATN LVNLTKLLSP 
EDVIIRNMTV TRDGEITFPP PPIQVSARPQ QTPSEKAAPA 
WK KLAPAAIA AVLVLWVGAV AP AAFLNHFI VFVLACVIG Y 
HT PLMSVTNA ISGIMWGAL LQIGQGNGFV SLLSFVAILI 
VTRRMLNMFK KG* " 

The following partial DNA sequence was 

ml55 . seq 

1 ATGAAAATCG GTATCCCACG CGAGTCATTA 

51 CTGTACGCCC GCCACCGTCG CCCTGCTGGG 

101 TTGTCGAAAG CGGTGCAGGT TTGGCGGCAA 

151 CAAACAGCAG GCGCAACCGT TGCCGACAAA 

201 TTTGATTTAT AAGGTCAACG CGCCGTCCGA 

251 ACGAAGGTCA AACCATCGTC AGCTTCCTGT 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 



155.ng>: 

LAASLDDAAY 
SFLWPRQNEA 
AVI EAANAFG 
VRAFDTRLEV 
KLFAEQAKEV 
NCELTRPGEL 
NKDGEITLDF 
AKPEPKPVPL 
HWWNVSHSL 
AGINIFGGFA 



identified in N. meningitidis <SEQ ID 639>: 



TCCGGCGAAA 
CAAACTGGGC 
GTTTGGACGA- 
GCGGCGGTTT 
ACAGGAACTG 
GGCCGCGCCA 



CCCGCGTCGC 
TTTGAAACCG 
TGCCGCTTAC 
GGGTCTGCCC 
CCGCTTTTGA 
AAACGAGGCT 
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301 TTGGTCGAAG CCTTGCGCGC 

351 GGTGCCCCGC ATTTCGCGCG 

401 CAAACATCAG CGGCTACCGC 

451 CGTTTCTTCA CCGGTCAAAT 

501 GGTTTTGGTG ATTGGTGCAG 

551 CAAACTCGCT CGGCGCAGTG 

601 GCGGAACAAA TCGAATCGAT 

651 AC AAGAAT CG GGCGGCAGCG 

701 AATTTATCGC AGCCGAGATG 

7 51 GACATCATCA TCACCACCGC 

801 GATTACCAAA GAAATGGTGG 

851 ATTTGGCGGC GGCGACGGGC 

901 TTGTCCGTAA CCGGCAACGG 

951 AAACCGCCTT GCCGGACAGT 

1001 ACCTGACCAA GCTGTTAAGC 

1051 TTCGAAGACG TGATTATCCG 

1101 CACCTTCCCG CCTCCGCCGA 

1151 CGTCTGAAAA AGCCGTGCCT 

1201 CTGTGGAAAA AACTCGCGCC 

1251 GGTCGGCGCG GTCGCACCCG 

1301 TTCTCGCCTG CGTCATCGGC 

1351 CTGCACACAC CGCTGATGTC 

14 01 CGTCGGCGCG CTGCTGCAAA 

1451 TGTCGTTTGT TGCCATCCTG 

1501 GCGGTAACAC GGCGTATGCT 



CAAGAAAGTG AACGCGCTGG CGATGGATAT 
CGCAGGCTTT GGACGCTTTG TCTTCGATGG 
GCCGTAATTG AAGCCGCCAA CGCCTTCGGC 
TACCGCCGCC GGCAAAGTGC CGCCCGCGCA 
GTGTGGCAGG TTTGGCGGCG ATCGGTACGG 
GTACGCGCGT TCGATACCCG CTTGGAAGTG 
GGGCGGCAAG TTCCTGAAAC TCGACTTCCC 
GAGACGGCTA CGCCAAAGTG ATGAGCGACG 
AAGCTCTTTG CCGAGCAGGC GAAAGAAGTG 
CGCCATTCCG GGCAAACCCG CGCCCAAGCT 
AAAGCATGAA ATCCGGCTCC GTCATCGTCG 
GGCAACTGCG AACTCACCCG CCCGGGCGAA 
CGTGAAAATC ATCGGCTACA CCGACATGGC 
CTTCCCAGCT TTACGCCACC AACTTGGTCA 
CCGAACAAAG ACGGCGAAAT CACGTTGGAC 
CAACATGACC GTTACCCACG ACGGCGAAAT 
TTCAAGTTTC CGCCCAGCCG CAGCAAACGC 
GCCGCCAAGC CCGAGCCAAA ACCCGTTCCC 
CGCCGTCATC GCCGCCGTCT TGGTACTGTG 
CAGCATTCCT GAACCACTTT ATCGTGTTCG 
TACTACGTCG TCTGGAACGT CAGCCACTCG 
GGTAACCAAC GCCATCTCCG GCATCATCGT 
TCGGTCAGGG CAACGGCTTC GTTTCGCTGC 
ATTGCCGGCA TCAACATCTT CGGCGGCTTT 
GAATATGTTT AAGAAAGGGT AA 



This corresponds to the amino acid sequence <SEQ ID 640; ORF 155>: 

ml 55 .pep 

1 MKIGIPRESL SGETRVACTP ATVALLGKLG FETWESGAG LAAS LDDAAY 

51 QTAGATVADK AAVWVCP LIY KVNAPSEQEL PLLN EGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDAL SSMANISGYR AVIEAANAFG 

151 RFFTGQITAA GKVPPAQVLV I GAGVAGLAA IGTANSLGAV VRAFDTRLEV 

201 AEQIESMGGK FLKLDFPQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKLITK EMVESMKSGS VIVDLAAATG GNCELTRPGE 

301 LSVTGNGVKI IGYTDMANRL AGQSSQLYAT NLVNLTKLLS PNKDGEITLD 

351 FEDVIIRNMT VTHDGEITFP PPPIQVSAQP QQTPSEKAVP AAKPEPKPVP 

4 01 LWKK LAPAV I AAVLVLWVGA V AP AAFLNHF IVFVLACVIG YYWWNVSHS 

451 LHT P LMSVTN AISGIIWGA L LQI GQGNG F VSLLSFVAIL IAGINI FGG F 

501 AVTRRMLNMF KKG* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 155 shows 97.9% identity over a 513 aa overlap with a predicted ORF (ORF 155.ng) 
from N. gonorrhoeae: 

ml55 / gl55 97.9% identity in 513 aa overlap 

10 20 30 40 50 60 

m!55 . pep MKIGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQTAGATVADK 
I M M I I I I I I I I j I I j I I I I M I | | 1 I | M II I I II II I I I I I I I I I I II I I I M I I M 
g!55 MKIG I PRE SLSGETRVACTPATVALLGKLGFETWESGAGLAAS LDDAAY QTAGATVADK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml55 . pep AAVWCPLIYKVNAPSEQELPLLNEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 
i t II : I I I II I II I I I I I I I I I : I I I M I I I I I I I I I II I I I I I I II I | | | | | | | | | | | 
g 1 5 5 AAVWACPLI YKVNAPSEGELPLLKEGQT I VS FLW PRQNE AL VE ALRAKKVN ALAMDMV PR 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml55 - pep I SRAQALDALSSMANISGYRAVIEAANAFGRFFTGQITAAGKVPPAQVLVI GAGVAGLAA 

M II M I I I M II M I | | | | | | | | | | | | | | | | | | | | | | | M | M | | | | | | | | | | | | | | || 
g!55 I SRAQALDAL S SMAN I S G YRAVI EAANAFGR FFTGQ I TAAGKVP PAQVLV I GAGVAGLAA 
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WO 99/57280 _ 

PCTAJS99/09346 

446 



130 140 150 



160 170 180 



ml55 .pep 



190 200 210 220 230 ?.an 

190 2 00 210 220 230 



240 



xnl55.pep ^AEQAK^ 

gl55 KLFAEQA^VDIIITTAAIPGKPAPKLITKEb^ESNKSGSVIVDLAA-TGGN^ 



250 260 270 280 290 

ml55.pep ^ L *^^ r ?" < ?^| :r ^^^^^ :< P^ ,: j-' J^M^N Q S S Q Y^TMi. t.TICLLS^KJICOGEIXT^S f E D V H 

UU 310 320 330 340 350 

370 380 3 90 400 410 /on 

VTHDGEITFPPPPIQV^ 

•tec 1 1 U 1 1 1 ' 1 1 1 1 1 M : 1 M I ( t I : I I I I i I M I M I M I II I I • I 1 I i I I M i i i 

9155 3 ^™^™™™av*^ 

J>bU 370 380 390 400 4 1 n 



ml 5 5 . pep 



-155 . pep ™^LNHFIV^ 

«i M 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 M I 11 I II I I | | j | | I I I I • M I j | M I I I ll I I 

9155 42 r wLN ^r™ LAwiGYH ^ vs ^ 

4 ^ U 430 440 450 460 470 

490 500 510 

ml55 . pep VSLLSFVAILIAGINIFGGFAVTRRMLNMFKKGX 
> I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 1 , 1 1 

VSLLSFVAILIAGINIFGGFAVTRRMLNMFKKGX 
480 490 500 510 



gl55 



The ^Jo^gPartial DNA sequence was identified in N. meningitidis <SEQ ID 641 >: 

1 ATGAAAATCG GTATCCCACG TGAGTCATTA TCCGGCGAAA CCCrrrvrrr 

51 CTGTACGCCC GCCACCGTCG CCCTGCTGGG cSctS^ SgSIcCG 

101 TTGTCGAAAG cggcgcaggt ttggcggcaa gtttggacga tgccgc^Sc 

151 caagcagcag gcgcaaccgt tgccgacaaa gcagcggttt gggStac?? 

201 TTTAATTTAT AAGGTTAACG CGCCGTCCGA AGACGAGCTG COsSSSS 

251 AAGAAGGACA GACCATCGTC AGCTTCCTGT GGCCGCGCCA AAACGAGGCT 

11, rr^ln^ CCTTG CGCGC CAAGAAAGTG AACGCGCTGG SJSKKJ 

In, ntl^ C ° GC ATTTCGC GCG CGCAGGCTTT GGACGNTTTG TCTTNGATGG 

401 CAAACATCAG CGGCTACCGC GCCGTGATTG AAGCCGCCAA CGCCTTCGGC 

451 CGTTTNTTCA CCGGCCAAAT TACTGCCGCA GGCAAAGTGC CGcSgCGCA 



ggttttggtg attggtgcag gtgtggcagg ^g^ggcg atc^acS 



501 

551 CAAACTCGCT CGGCGCAGTG G^G^ ^ACCCG 

601 GCGGAACAAT TAGAATCGAT GGGCGGCAAG TTCCTGAaS JcGAC^cS 

651 GCAAGAATCG GGCGGCAGCG GCGACGGCTA CGCCAAAGTG ATGAGCGACG 

701 AATTTATCGC CGCCGAGATG AAGCTTTTTG CCGAGCAGGC SgS^G 

751 GACATCATCA TCACCACCGC CGCCATTCCG GGCAAACCCG CgIScaS 

aSgGCG^ SJES^ ACCCGGCxS 

ATTTGGCGGC GGCGACGGGC GGCAACTGCG AACTCACCAA ACArrrrran 

5° gccggacapt ^ GAAAATC SSSSJ 

A S ^GCAGCT TTACGCCACC AACTTGGTCA 
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1001 
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ACCTGACCAA GCTGTTAAGC CCGA^AG ACGGCGAAAT 
r^™ G TGATTATCCG CAACATGACC GTTACCCGCG ACGGCgS 
CACCTTCCCG CCTCCGCCGA TTCAAGTTTC CGCCCAACCG CAGCAAACGC 
CGTCTGAAAA AGCCGCGCCT GCCGCCAAGC CCGAACCGAA A^S^Sc 
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1201 CTGTGGAAAA AACTCGCGCC CGCCNTNATC GCCGCCGTGT TGGTACTGTG 

1251 GGTCGGCGCG GTCGCACCCG CAGCATTCCT GAACCACTTT ATCGTCTTCG 

1301 TCCTCGCCTG CGTCATCGGC TACTATGTCG TTTGGAACGT CAGCCACTCG 

1351 CTGCACACAC CGCTGATGTC GGTGACCAAC GCCATTTCCG GCATCATCGT 

14 01 CGTCGGCGCG CTGCTGCAAA TCGGTCAGGG CAACGGCTTC GTTTCGCTGC 

14 51 TGTCGTTTGT TGCCATCCTG ATTGCCAGCA TCAACATCTT CGGCGGCTTC 

1501 TTTGTAACGC GGCGGATGCT GAATATGTTT AGGAAAGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 642; ORF 155.a>: 

al55 .pep 

1 MKIGIPRESL SGETRVACTP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QAAGATVADK AAVWAYPLIY KVNAPSEDEL PLLKEGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDXL SXMANISGYR AVIEAANAFG 

151 RXFTGQITAA GKVPPAQ VLV IGAGVAGLAA IGTA NSLGAV VRVFDTRLXV 

201 AEQLESMGGK FLKLDFPQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKXXXK EMVESMKPGS VIVDLAAATG GNCELTKQGE 

301 LFVTGNGVKI IGYTDMANRL AGQSSQLYAT NLVNLTKLLS PNKDGEITLD 

351 FEDVIIRNMT VTRDGEITFP PPPIQVSAQP QQTPSEKAAP AAKPEPKPVP 

4 01 LWK KLAPAXI AAVLVLWVGA V AP AAFLNHF IVFVLACVIG YYWWNVSHS 

4 51 LHTP LMSVTN AISGIIWGA L LQIGQGNG F VSLLSFVAIL IASINI FGGF 

501 FVTRRMLNMF RKG* 

ml55/al55 95.3% identity in 513 aa overlap 
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ml55 . pep MKIGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQTAGATVADK 
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al55 MKIGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQAAGATVADK 
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ml55 . pep AAVWCPLIYKVNAPSEQELPLLNEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 
Mil: I I I I I I I I I I I : II I I I : I I ! f I I i I I 11 I I I I I I I I I I I II I! I I I I I I I I I I 
al55 AAVWAYPLIYKVNAPSEDELPLLKEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml55 . pep I SRAQALDALSSMAN I SGYRAVIEAANAFGRFFTGQITAAGKVPPAQVLV IGAGVAGLAA 

I M I I I I I II I I I I I I I II I I I I I I I ! ! I I I I I I I I I I I I I I I I I I I I I II i I I I I I. 
al55 ISRAQALDXLSXMANISGYRAVIEAANAFGRXFTGQITAAGKVPPAQVLVIGAGVAGLAA' 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml55 . pep IGTANSLGAWRAFDTRLEVAEQIESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 
I I I I I I I II 1 I I : I I I I I I I I I : I I II I II I I I I I 1 M I 1 I E I I I ! I I I I I I I I I M I I 
al55 IGTANSLGAWRVFDTRLXVAEQLESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml55 .pep KLFAEQAKEVDIIITTAAIPGKPAPKLITKEMVESMKSGSVIVDLAT^ATGGNCELTRPGE 
M I I I I I I I I I I I I I I I I I I II I I I I : I I I I I I I I I I I I I I I I I I I I I ! I I I I : II 
al55 KLFAEQAKEVDI I ITTAAI PGKPAPKXXXKEMVESMKPGS VI VDLAAATGGNCELTKQGE 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml55 . pep LSVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 
I I I I I I N I I I I I I I I I I I I I I I I t I I I I I I | | | | I | | | | | | | | M I I I I I I I I I I I I I 
al55 LHVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 55 . pep VTHDGEITFPPPPIQVSAQPQQTPSEKAVPAAKPEPKPVPLWKKLAPAVI AAVLVLWVGA 

M : I 1 I I I It ! I I I I I I I i M I M I I I I : I I I I I I I I I I I I I I I I I I I I II I I I I II I I 
al55 VTRDGEITFPPPPIQVSAQPQQTPSEKAAPAAKPEPKPVPLWKKLAPAXI AAVLVLWVGA 

370 380 390 400 410 420 
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490 500 510 

ml55.pep VSLLSFVAILIAGINIFGGFAVTRRMLNMFKKGX 

I I ' N I I M ! I I : I I I | ! 1 [ | t I; | | f 

a155 VSLLSFVAILIASINIFGGFFVTRRMLNMFRKGX 

490 . 500 510 . 



The following partial DNA sequence was identified in N. gonorrhoeae <SEO ID 643>- 

gl56.seq 

1 ATGACTTTCG CCTATTGGTG CATTCTGATT GCCTGCCTAT" TGCCGCTTTT 

51 TTGTGCGGCG TATGCCAAAA AAGCGGGCGG ATTCCGGTTT AAAGACAACC 

101 ACAATCCTCG CGGTTTTCTG GCACATACGC AAGGCGCAGC CGCCCGTGCC 

151 CACGCCGCGC AGCAAAACGG TTTTGAAGCC TTTGCACCGT' TTGCCGCCGC 

201 CGTTTTGACG GCACACGCAA CCGGCAATGC CGGACAAGCA ACCGTCAACA 

251 CGCTTGCCGG ATTGTTCATC CTGTTCCGCC TCGCCTTTAT -CTGGTGCTAC 

301 AT CG CAGAC A AAGCAGCATT GCGCTCGCTG ' ATGTGGGCGG GCGGATTTGC 

351 CTGCACCGTC GGACTGTTTG TCGCGGCTGC TTGA 

This corresponds to the amino acid sequence <SEQ ID 644; ORF 156.ne>* 

gl56.pep ' B 

1 MTFAYWCJLI ACLLPLFCAA YAKKAGGFRF KDNHNPRGFL AHT nnaA^a 

51 HAAQQNGFEA FAPFAAAVLT AHATGNAGQA TVNT LAGLFI LFRLAFIWCY 

101 IADKAALRS L MWAGG FACT V GLFVAAA * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 645>- 

ml56.seq 

1 ATGACTTTCG CCTATTGGTG TATTCTGATT GCCTGCCTAT TGCCGCTTTT 

51 TTGTGCGGCG TATGCCAAAA AAGCGGGCGG ATTCCGGTTT AAAGACAACC 

101 ACAATCCGCG - CGGTTTTCTA GCGCACACGC AAGGCGCAGC CGCCCGTGCC 

151 CACGCCGCAC AGCAAAACGG TTTTGAAGCC TTTGCACCGT TTGCCGCCGC 

201 CGTTTTGACG GCACACGCAA CCGGCAATGC GGCGCAATCG ACCATCAACA 

251 CGCTTGCCTG CCTGTTCATC CTGTTCCGCC TCGCCTTTAT CTGGTGCTAT 

301 ATCGCCGACA AAGCCGCTAT GCGCTCACTG ATGTGGGCAG GCGGATTTGC 

351 CTGCACCGTC GGGCTGTTTG TCGCGGCTGC TTGA 

This corresponds to the amino acid sequence <SEQ ID 646; ORF 156>: 

ml 5 6. pep 

* MTFAYWCILI ACLLPLFCAA YAKKAGGFRF KDNHNPRGFL AH TOGAAARR 

51 HAAQQNGFEA FAPFAAAVLT AHATGNAAQS TINT LACLF I LFRLAFIWCY 

101 IADKAAMRS L MWAGGFACTV GLFVAAA * 

Computer analysis of this amino acid sequence gave the following results- 
Homology with a predicted ORF from N. gonorrhoeae 

ml56 / gl56 96.1% identity in 127 aa overlap 



ml 5 6 .pep 
gl56 
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MTFAYWCILIACLLPLFCAAYAKKAGGFRFKDNHNPRGFIAHTQGAAARAHAAQQNGFEA 
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1t . c 70 80 90 loo no i?r> 

ml56 . pep F AP FAAAVLTAHATGNAAQSTINTLACLFILFRIAFIWCYIADKAAMRSI^AGGFACTV 
1 1 1 1 1 1 1 1 1 1 1 1 1 N I I : I •• I : I I I I M I I I I II I I I I I I I I I I I : | | | | | | | | | | , | | 
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